About Us
CLOUDSUFI, a Google Cloud Premier Partner, is a global leading provider of data-driven digital transformation across cloud-based enterprises. With a global presence and focus on Software & Platforms, Life sciences and Healthcare, Retail, CPG, financial services and supply chain, CLOUDSUFI is positioned to meet customers where they are in their data monetization journey.
Our Values
We are a passionate and empathetic team that prioritizes human values. Our purpose is to elevate the quality of lives for our family, customers, partners and the community.
Equal Opportunity Statement
CLOUDSUFI is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. All qualified candidates receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation and national origin status. We provide equal opportunities in employment, advancement, and all other areas of our workplace. Please explore more at
https://www.cloudsufi.com/
Must-have
- 5+ years of data engineering; at least 2 years working on connector or integration framework development
- Deep Python expertise including PySpark, pyarrow, and an understanding of Spark's execution model (driver vs executor, serialization constraints, partition fan-out)
- Hands-on experience with at least one SaaS ingestion platform — Fivetran, Airbyte, Google DTS, AWS Glue connectors, or equivalent — at the connector-build level, not just configuration
- Strong understanding of OAuth 2.0 flows (auth code, PKCE, client credentials, JWT), rate limiting strategies (token bucket, leaky bucket, per-endpoint quotas), and incremental sync patterns (cursor, watermark, CDC)
- Experience designing shared connector frameworks — reusable auth managers, rate governors, state stores — not just per-connector scripts
- Ability to author and own TDDs and PRDs that can be handed to a junior engineer with minimal back-and-forth
Nice-to-have
- Prior exposure to Databricks Asset Bundles / Declarative Automation Bundles or Lakeflow pipelines
- Experience with the Databricks Python Data Source API (DBR 15.4 LTS+) — extremely rare, so treat practical Spark DSv2 Java/Scala background as equivalent
- GCP DTS or Cloud Data Fusion connector experience
- Knowledge of the specific source systems particularly Social Ads APIs (Meta, LinkedIn, X) or enterprise SaaS (Salesforce, Oracle)
Behavioural Competencies
- Should have very good verbal and written communication, technical articulation, listening and presentation skills
- Should have proven analytical and problem-solving skills
- Should have demonstrated effective task prioritization, time management and internal/external stakeholder management skills
- Should be a quick learner, self-starter, go-getter and team player
- Should have experience of working under stringent deadlines in a Matrix organization structure