Hi;
Hope you are doing great
Please go through the JD and revert me your comfortness :-
Position:-Data Engineer (Pyspark, AWS)
Location : NJ
Data engineer needed with Pyspark and AWS and max rate is $60/hrs on c2c
Major Duties & Responsibilities
Data engineering platform support
- Understand data and functional requirements to enable and support data engineering pipelines.
- Act as a domain specialist in Data Engineering technologies and bring outstanding, innovative ideas to develop, test, and measure performance and impact of initiatives.
- provide recommendations on performance improvement.
- Collaborate with ML factory model support partners to use ML-DevOps capabilities to operationalize ML models.
- Develop processes and tools to monitor and analyze model performance and data accuracy
Drive the design and build of new data pipelines & feature engineering layer
- Apply data modeling, data engineering and feature engineering principles to support data science requirements and supply raw, curated, and processed data for machine learning engineers and data scientists.
- Work in multi-functional agile teams to continuously experiment, iterate, and deliver business goals and objectives.
- Collaborate with other data engineers, ML specialists, and partners from multiple therapeutic areas to take findings and alignments as they arise.
- Lead the development and implementation of data engineering and feature engineering pipelines for predictive models and model tracking.
Lead the design and implementation of data engineering platform
- Collaborate with data engineers and data scientists to build scalable data engineering and data science solutions improving the AWS platform, PySpark, Python, and Dataiku.
- Assist in developing architectural models for cloud-based data engineering solutions improving AWS technologies to support large scale data science platforms.
Qualifications
Required Knowledge, Skills and Abilities:
- Proficiency in building data engineering pipelines in Cloud to support ML projects
- Familiarity in data modeling, data access, and data storage techniques in the Cloud environment.
- Proficiency in Python, Pyspark, SPARK, EMR, EC2, RedShift or similar technologies
- Deep understanding of collaborative data science platform like Dataiku will be preferred
- Strong background and experience in the healthcare/ life sciences field will be highly beneficial
- Good understanding of machine learning algorithms in life sciences Sales & Marketing space will be preferred.
- Familiarity with how data scientists work and how DS/ML solutions can and should scale in production.
- Good track record of translating business requirements into technical designs for new technology solutions.
- Demonstrated good leadership capabilities through technology solution ownership and adoption.
- Understand the value of collaboration within teams, are excellent communicators, and build relationships with a diverse set of internal and external partners.
- Demonstrated technical innovation and experimentation of the emergent solutions in alignment with project roadmap.
Vaibhav Kumar | VBeyond Corporation
Hangout:-vaibhavvbeyond@gmail.com
Note: VBeyond is fully committed to Diversity and Equal Employment Opportunity.
Disclaimer: We respect your Online Privacy. This is not an unsolicited mail. Under Bill S 1618 Title III passed by the 105th US Congress this mail cannot be considered Spam as long as we include Contact information and a method to be removed from our mailing list. If you are not interested in receiving our e-mails then please reply to vaibhavk@vbeyond.com subject=Remove. Also mention all the e-mail addresses to be removed which might be diverting the e-mails to you. We are sorry for the inconvenience
Comments
Post a Comment
Thanks