Big Data Engineer
Recruiter
Listed on
Location
Salary/Rate
Salary Notes
Type
Start Date
This job has now expired please search on the home page to find live IT Jobs.
Job Summary:
Senior software engineer with experience designing and developing data integrations solutions including big data architecture and analytic solutions. Skillset should focus on data integration & migration utilizing Talend, Scala & Spark for developing and migrating big-data on AWS cloud platform. Candidate must be willing to work collaboratively in a team environment and manage multiple assignments. Must actively support and contribute to the Agile practices adopted by the development team. Must have a proven history of building data pipelines carrying high volume data.
Key Responsibilities:
-Implementing data ingestion/presentation/semantics data layers using Talend as guided by BA/Data Governance.
-Uses the data ingestion/data transformation frameworks developed by the consultants and where necessary assist in the development of new frameworks /or framework variants.
-Has a domain understanding of both the data and the use cases and therefore is responsible for creating common profiles for data elements to facilitate discovery and reusability.
-Data discovery processes
-Skills in data modelling (both structured and unstructured) data
-Skills in metadata repositories (Talend/Distro)
--Skills in data acquisition (ingestion and metadata)
-Skills in data manipulation: Talend, Java, Drill, Scala, Python
-Experience with Hadoop / Hive.
Essential Skills:
-Good experience with Talend Big Data and Talend Data Integration
-Good implementation experience in AWS cloud platforms.
-Development of Talend data intake, cleansing and transformation process in a Cloudera Data Lake framework.
-Batch and streamed data flows in and out of Cloudera.
-Co-Design of comprehensive Tech Metadata management framework under Cloudera
-Integration of In Memory Data Grid solutions.
-Integration with Salesforce
-Talend Job optimization and integration into Oozie.
-Good DBMS knowledge : Redshift, Oracle, MS SQL, MySQL, Postgresql,
-Data discovery processes
-Skills in data modelling (both structured and unstructured) data
-Skills in metadata repositories (Talend/Distro)
-Skills in data acquisition (ingestion and metadata)
-Skills in data manipulation: Talend
-Experience with Hadoop / Hive.
-Experience building and operating scalable infrastructure software or distributed systems
-Demonstrable track record as an owner: someone who can take a concept and make it real.
-A thirst for knowledge and continuous improvement.
The following 3 Talend certifications:
-Talend Data Integration Basics
-Talend Data Integration Advanced
-Talend Big Data Basics
Nice to Have Skills:
-Experience in Agile/SCRUM enterprise-scale software development
-Familiar with building secure software using modern security principles
-Demonstrated ability to achieve stretch goals in a highly innovative and fast paced environment
-Data management and analytics
-Generation of data catalogues/models to user communities (e.g. ERWIN)
-Knowledge of Cloudera platform: HDFS, Hbase, Hive, Impala.
-Scala, Python / Spark Programming
-Skills in data manipulation: Lambda, Solr etc.
-Experience with Spark environment
Certification in below Talend modules:
-Talend Big Data Advanced - Spark
-Talend Data Mapper Essentials
-Talend Data Integration Admin
Cognizant US Corporation is an Equal Opportunity Employer Minority/Female/Disability/Veteran. If you require accessibility assistance applying for open positions in the US, please send an email with your request.
To apply for this role please click the APPLY button.