About Us
CLOUDSUFI, a Google Cloud Premier Partner, is a global leading provider of data-driven digital transformation across cloud-based enterprises. With a global presence and focus on Software & Platforms, Life sciences and Healthcare, Retail, CPG, financial services and supply chain, CLOUDSUFI is positioned to meet customers where they are in their data monetization journey.
Our Values
We are a passionate and empathetic team that prioritizes human values. Our purpose is to elevate the quality of lives for our family, customers, partners and the community.
Equal Opportunity Statement
CLOUDSUFI is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. All qualified candidates receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation and national origin status. We provide equal opportunities in employment, advancement, and all other areas of our workplace. Please explore more at
https://www.cloudsufi.com/
Must-have
- 3+ years of Python data engineering; strong Python fundamentals including class design, serialization, error handling, and test writing
- Experience integrating with REST APIs — building HTTP clients, handling pagination, managing auth token lifecycle, implementing retry logic
- Familiarity with pyarrow or at least columnar data formats (Parquet, Arrow)
- Ability to read and interpret API documentation for an unfamiliar source system and produce a working extraction implementation within 1–2 weeks
- Understanding of incremental sync concepts — watermarks, cursors, full-refresh vs delta patterns
- Proficiency with pytest — unit tests with mocks, fixture-based testing, parametrize
Nice-to-have
- PySpark experience (even basic DataFrame operations)
- Experience with pydantic for API response modeling
- Exposure to at least one Social Ads API (Meta, LinkedIn, X, TikTok)
- Experience with requests-oauthlib or similar OAuth libraries