Mumbai, Pune
Technologies / Skills:
Advanced SQL, Python and associated libraries like Pandas, Numpy etc., Pyspark , Shell scripting, Data- Modelling, Big data, Hadoop, Hive, ETL pipelines and IaC tools like Terraform etc.
Responsibilities:
• Efficient communication skills to coordinate with users, technical teams and Data\Solution architects.
• Document technical design documents for given requirements or JIRA stories.
• Communicate results and business impacts of insight initiatives to key stakeholders to collaboratively solve business problems.
• Working closely with the overall Enterprise Data & Analytics Architect and Engineering practice leads to ensure adherence with the best practices and design principles.
• Assures quality, security and compliance requirements are met for supported area.
• Develop fault-tolerance data pipelines running on cluster
• Ability to come up with scalable and modular solutions
Required Qualification:
• 1-8 yrs of hands-on experience developing data pipelines for Data Ingestion or transformation using Python (PySpark) /Spark SQL in AWS cloud
• Experience in development of data pipelines and processing of data at scale using technologies like EMR, Lambda, Glue, Athena, Redshift, Step Functions.
• Advanced experience in writing and optimizing efficient SQL queries with Python and Hive handling Large Data Sets in Big-Data Environments
• Experience in debugging, tunning and optimizing PySpark data pipelines
• Should have implemented concepts and have good knowledge of Pyspark data frames, joins, partitioning, parallelism etc.
• Understanding of Spark UI, Event Timelines, DAG, Spark config parameters, in order to tune the long running data pipelines.
• Experience working in Agile implementations
• Experience with Git and CI/CD pipelines to deploy cloud applications
• Good knowledge of designing Hive tables with partitioning for performance
Thanks and Regards
HR TEAM
Experience | 1 - 7 Years |
Salary | 80 Thousand To 12 Lac P.A. |
Industry | IT Software - Application Programming / Maintenance |
Qualification | Other Bachelor Degree |
Key Skills | ETL Hadoop Python AWS Spark Data Engineer Walk in |
(26)IT Software - Application Prog...
(12)Sales & Marketing / Business D...
(8)HR / Recruitment / Administrat...
(8)Architecture / Interior Design
(5)Accounting / Auditing / Taxati...
(4)ITES / BPO / KPO / LPO / Custo...
(4)IT Hardware / Technical Suppor...
(3)Front Office / Reception / Com...
(3)Manufacturing / Production / Q...
(2)Supply Chain / Purchase / Proc...
(1)IT Software - ERP / CRM / EDP ...
(1)Analytic and Business Intellig...
(1)Education / Teaching / Trainin...
(1)Media / Entertainment / TV / ...
(1)Cargo / Freight / Transportati...
View MoreHi! Simply click below and type your query.
Our experts will reply you very soon.