- Design, implement, and optimize search services using technologies such as Elasticsearch, OpenSearch, Solr, or vector search frameworks.
- Collaborate with data scientists and analysts to deliver data models and insights.
- Develop custom ranking algorithms, relevancy tuning, and semantic search capabilities tailored to scientific and medical content
- Support the development of intelligent search features like query understanding, question answering, summarization, and entity recognition
- Build and maintain robust, cloud-native APIs and backend services to support high-availability search infrastructure (e.g., AWS, GCP, Azure
- Implement CI/CD pipelines, observability, and monitoring for production-grade search systems
- Work closely with Product Owners, Tech Architect.
- Enable indexing of both structured (e.g., clinical trial metadata) and unstructured (e.g., PDFs, research papers) content
- Design & develop modern data management tools to curate our most important data sets, models and processes, while identifying areas for process automation and further efficiencies
- Expertise in programming languages such as Python, Java, React, typescript, or similar.
- Strong experience with data storage and processing technologies (e.g., Hadoop, Spark, Kafka, Airflow, SQL/NoSQL databases).
- Demonstrate strong initiative and ability to work with minimal supervision or direction
- Strong experience with cloud infrastructure (AWS, Azure, or GCP) and infrastructure as code like Terraform
- In-depth knowledge of relational and columnar SQL databases, including database design
- Expertise in data warehousing concepts (e.g. star schema, entitlement implementations, SQL v/s NoSQL modeling, milestoning, indexing, partitioning)
- Experience in REST and/or GraphQL
- Experience in creating Spark jobs for data transformation and aggregation
- Experience with distributed, multi-tiered systems, algorithms, and relational databases.
- Possesses strong rapid prototyping skills and can quickly translate concepts into working code
- Develop and execute unit tests, integration tests, and other testing strategies to ensure the quality of the software
- Analyze and understand the functional and technical requirements of applications
- Identify and resolve software bugs and performance issues
- Work closely with multi-functional teams, including product management, design, and QA, to deliver high-quality software on time
- Maintain detailed documentation of software designs, code, and development processes
Basic Qualifications:
- Degree in computer science & engineering preferred with 6-8 years of software development experience
- Proficient in Python, Java, React, typescript, Postgres, Databricks
- Hands-on experience with search technologies (Elasticsearch, Solr, OpenSearch, or Lucene).
- Hands on experience with Full Stack software development. Proficient in programming languages, Java, Python, Fast Python, Databricks/RDS, Data engineering, S3Buckets, ETL, Hadoop, Spark, airflow, AWS Lambda
- Experience with data streaming frameworks (Apache Kafka, Flink).
- Experience with cloud platforms (AWS, Azure, Google Cloud) and related services (e.g., S3, Redshift, Big Query, Databricks)
- Hands on experience with various cloud services, understand pros and cons of various cloud services in well architected cloud design principles
- Working knowledge of open-source tools such as AWS lambda.
- Strong problem solving, analytical skills; Ability to learn quickly; Excellent communication and interpersonal skills
Preferred Qualifications:
- Experience in Python, Java, React, Fast Python, Typescript, JavaScript, CSS HTML is desirable
- Experienced with API integration, serverless, microservices architecture.
- Experience in Data bricks, PySpark, Spark, SQL, ETL, Kafka
- Solid understanding of data governance, data security, and data quality best practices
- Experience with Unit Testing, Building and Debugging the Code
- Experienced with AWS /Azure Platform, Building and deploying the code
- Experience in vector database for large language models, Databricks or RDS
- Experience with DevOps CICD build and deployment pipeline
- Experience in Agile software development methodologies
- Experience in End-to-End testing
- Experience in additional Modern Database terminologies.
Good to Have Skills
- Willingness to work on AI Applications
- Experience in NLMs, Solr Search
- Experience with popular large language models
- Experience with LangChain or LlamaIndex framework for language models
- Experience with prompt engineering, model fine tuning
- Knowledge of NLP techniques for text analysis and sentiment analysis