
Search by job, company or skills
Experience: 5+ years building and operating production-grade Python services.
Location: Remote
To streamline and fast-track screening, please submit your details here (if you haven't already): https://airtable.com/appbtkr4odapnb5I6/pag8eyxvIdQ5YQCku/form
We'll review your responses as part of the initial screening process. Please make sure you complete and submit all details through the form to be considered for the next stage. Submissions outside the form may not be considered.
Why This Role Matters
Every insight Terrabase delivers travels through a Python service you will own. Our platform powers real-time agent workflows, multi-connector data pipelines, sandboxed execution, and versioned artifact delivery, all streaming live to enterprise customers. Reliable async workers, low-latency APIs, and precise observability are not nice-to-haves here. They decide whether customers trust the system.
Your mission: keep this engine reliable and scale it as we grow.
What You Will Do
Own the FastAPI platform. Design, extend, and operate the core services powering agent orchestration, connector management, schema resolution, streaming chat, and sandboxed execution. Async handlers, SSE and WebSocket support, Pydantic v2 validation, SQLAlchemy with Alembic migrations against PostgreSQL.
Build and scale async workers. Operate Celery workers backed by Redis and RabbitMQ for schema fetching, task routing, stuck-task detection, and real-time notifications. Understand failure modes at the worker level, not just the API level.
Own the context layer pipeline. Build and operate the ingestion pipeline that processes enterprise documents, extracts and ranks business concepts, and builds the structured knowledge layer that agents reason over. This covers connector integrations, chunking strategies, and the data contracts between upstream sources and the agent layer.
Manage data connections at scale. Build and harden runtime connectors to Snowflake, DuckDB, Databricks, BigQuery, and other warehouse and SaaS sources. Handle encrypted credentials, OAuth flows, and live schema discovery. Make connections stay alive, fail cleanly, and recover fast.
Instrument everything. Own the observability stack: Prometheus and Grafana, structured logging with correlation IDs, OpenTelemetry tracing, health endpoints. P99 latency and error budgets are yours to define and defend.
Ship and operate on AWS. Docker-based deployments, Nginx, Terraform, GitHub Actions CI/CD. Write runbooks and post-mortems anyone can use to debug at 2am. Harden secrets management and SOC 2 logging.
Collaborate across teams. The platform serves LangGraph-based agent workflows and React frontends. Design API contracts that enable sub-second streaming responses and zero-downtime releases.
What We Are Looking For
Bonus Points
Life at Terrabase
Sharp, fully remote team shipping to enterprise customers weekly. Real ownership, generous cloud budgets, and a culture that prizes reliability over ceremony.
Terrabase is an equal-opportunity employer. We celebrate diversity and are committed to building an inclusive environment for every team member.
Job ID: 150038473
Skills:
Hadoop, Kafka, Tensorflow, Gcp, Pytorch, Docker, Spark, Azure, Python, Kubernetes, AWS, Performance Optimization, Infrastructure as Code, AI ML Technologies, MLflow, TensorBoard, CI CD, Hybrid Cloud Architecture, Kubeflow
Skills:
Design Patterns, Oops, Hibernate, CSS, Oracle Sql, PostgreSQL, Spring Boot, Soap, Edi, HTML, Spring, Java 8, REST, Gcp, Javascript, MySQL, Reactjs, Sql Database, Oracle, Azure, Python, AWS
Skills:
Hibernate, MySQL, PostgreSQL, Spring Boot, Restful Apis, Java 11, Agentic AI, Microservices architecture
Skills:
.NET, Node.js, Microservices, React Js, Docker, Azure, Python, Kubernetes, AWS, Event-driven architecture, Security best practices, Next.js
Skills:
.NET, Unit Testing, Angular, React, Git, Databricks, Python, ETL ELT processes, service mesh, data analytics platforms, Streaming Data, SDLC best practices, event-driven architectures, distributed tracing, API SDK design, Synapse, observability integrations, AI ML concepts, microservices architecture, load testing tools, CI CD pipelines
We don’t charge any money for job offers