Search by job, company or skills

F

spark,python(pyspark developer)

2-6 Years
Save
new job description bg glownew job description bg glow
  • Posted a month ago
  • Over 100 applicants
Quick Apply

Job Description

The developer must have sound knowledge in Apache Spark and Python programming.

Deep experience in developing data processing tasks using Pyspark such as reading data from external sources, merge data, perform data enrichment and load in to target data destinations.

Create Spark jobs for data transformation and aggregation Produce unit tests for Spark transformations and helper methods.

Design data processing pipelines to perform batch and Realtime/stream analytics on structured and unstructured data.

Spark query tuning and performance optimization Good understanding of different file formats (ORC, Parquet, AVRO) to optimize queries/processing and compression techniques.

SQL database integration.

Hands on expertise in cloud services like AWS.

Mandatory skills

Spark, Python

Desired skills

Spark, Python.

More Info

Job Type:
Industry:
Function:
Employment Type:
Open to candidates from:
Indian

About Company

At Fusion Plus Solutions Inc, we believe that it’s an exceptional company - a company of people proud of the work they do and the solutions they provide. By understanding what drives our specialty industries, becoming involved in our communities on a professional and personal basis, following a disciplined process of identifying quality candidates, partnering with employers to understand their core business and their employment requirements, and delivering exceptional service, we achieve great results for all concerned.

Job ID: 121829065

Similar Jobs

Hyderabad, India

Skills:

PysparkSqlJavaPythonMavenScalaJenkinsGitdbtAWS Ecosystem

Hyderabad, India

Skills:

snowflake S3PysparkApache SparkRedshiftSqlLambdaSparkData LakePythonAWSEtlCopilotClaudeAI coding assistantsGlue

Hyderabad

Skills:

SparkAws CloudPythonTerraformJenkinsHadoop

Hyderabad, India

Skills:

HadoopData ModelingSparkBig DataKafkaData WarehousingDatabase DesignData PipelinesReal-Time Data Processing

Remote

Skills:

data engineering Apache SparkDistributed ComputingBig Data ProcessingCloud PlatformsScalable Code Development