Hello Professionals,
I am Vivek Shukla and I work with Shift Code. We are currently working with one of our client for a position. Please review the Job description below and let me know if you have any candidates ??
Data Engineer
Remote
Interview : Phone and skype
Visa: USC, GC, GC EAD, H4, L2
LinkedIn Must
Candidate must have 9+ years of experience and 4+ as a Sr. Engineer
Description :
The main skills that are a must are
AWS
Streaming Kafka Exp
Python
Has to have worked with a minimum of a terabyte of data.
Responsibilities
What you'll do:
You will be working with your team, peers, partners, cross-functional teams and vendors to:
Migrate and re-engineer data products from on-prem to cloud.
Gather and process all types of data including raw, structured, semi-structured, and unstructured data.
Coding in Java, Scala or Python
Build and deploy data pipelines and database processes, including SQL and NoSQL databases for enterprise data management applications .
Analyse requirements and technical specifications, and implement assigned development tasks using various technologies.
Mentor and lead junior data engineers by providing technical guidance and oversight.
Provide ongoing support, monitoring, and maintenance of deployed products.
Respond to data and product related inquiries in real-time to support business and technical teams.
Drive and maintain a culture of quality, innovation and experimentation.
Functional areas: master data management , 2nd & 3rd Party Data Management, Data Quality, Data Controls.
Edit Responsibilities
Qualifications
What you'll need:
- BS/MS or above in computer science or data science programs.
- 7+ years professional experience with Development, R&D or Information Technology preferably
- Highly proficient in one of the programming language Scala or Python or Java.
- Highly proficient in writing shell scripts.
- Experience with data warehouse technologies: MapReduce, HDFS, Hive, Tez, Spark, Sqoop
- Experience with streaming technologies - Kafka, Kafka Connect, KStreams, KSQL, Beam, Flink, Spark
- Experience developing for Linux-based deployment platforms, developing scalable, multithreaded server-side software for deployment
- Experience with cloud computing - Google Cloud Platform, Amazon Web Services
- Experience with API design/development – RPC, REST, JSON
- Experience with CI/CD, build and deployment technologies such as Jenkins
- Experience with Data Visualization or Data Notebook tools (i.e Zeppelin, Tableau, etc.)
- Experience developing SQL applications of significant complexity
- Experience MDM and Data Catalog will be added advantage.
- Familiarity with BI reporting tools (Qlikview, Tableau)
- Experience in machine learning process will be a plus.
- Core understanding and proficiency in
- Multiple programming languages such as Scala and/or Python
- Cloud platform offerings and capabilities – especially AWS : S3, Aurora, Redshift, Lambda, Glue
- SQL based data storage systems such as Postgrsql, Teradata
- Some experience with:
- Big Data OLAP datastore such as Singlestore
- Jupyter notebook
- Any ETL tool – Talend will be a plus
- Any experience with the following is bonus:
- NoSQL based storage systems such as Neo4j
- Containerization technologies such as Docker, Kubernates
- Machine Learning algorithms
- Data Catalogs for technical metadata
Vivek Shukla
Technical Recruiter
ShiftCode Analytics Inc.,
5118 Sylvester loop Tampa,
Florida 33610
Direct: 510-955-4703
E-mail: vshukla@shiftcodeanalytics.com
URL: http://www.shiftcodeanalytics.com/
No comments:
Post a Comment
Thanks
Gigagiglet
gigagiglet.blogspot.com