
Job Information
IBM Data Engineer-Data Platforms in Pune, India
Introduction
In this role, you'll work in one of our IBM Consulting Client Innovation Centers (Delivery Centers), where we deliver deep technical and industry expertise to a wide range of public and private sector clients around the world. Our delivery centers offer our clients locally based skills and technical expertise to drive innovation and adoption of new technology
Your role and responsibilities
Design, build, optimize and support new and existing data models and ETL processes based on our clients business requirements.
Build, deploy and manage data infrastructure that can adequately handle the needs of a rapidly growing data driven organization.
Coordinate data access and security to enable data scientists and analysts to easily access to data whenever they need too.
Required technical and professional expertise
Must have 3-5 years exp in Big Data -Hadoop Spark -Scala ,Python
Hbase, Hive Good to have Aws -S3,
athena ,Dynomo DB, Lambda, Jenkins GIT
Developed Python and pyspark programs for data analysis.. Good working experience with python to develop Custom Framework for generating of rules (just like rules engine).
Developed Python code to gather the data from HBase and designs the solution to implement using Pyspark. Apache Spark DataFrames/RDD's were used to apply business transformations and utilized Hive Context objects to perform read/write operations..
Preferred technical and professional experience
Understanding of Devops.
Experience in building scalable end-to-end data ingestion and processing solutions
Experience with object-oriented and/or functional programming languages, such as Python, Java and Scala"