hi
please share profiles for below JD except CPT
Title: Data Engineer Location: Dublin, OH & Dallas, TX Looking for an individual who has specifically above skills as mentioned. Job Description: 2-3 years of experience required - Design and architect high quality data-lake, data-warehouse, and data-marts data models
- Build and maintain data pipelines orchestration
- Build process for extraction, transformation, and loading of data from wide varieties of data sources (e.g. APIs, CSV, Excel, Other databases, etc.) using Python and Big Data technologies
- Build and optimize performance of Pyspark, Hive, Spark, Kafka etc. for real-time pipelines
- good hands on HIve, hadoop, big query, GCP, Azure databricks.
- Collaborate with business team to migrate legacy applications to modern architectures
- Develop Data Quality checks for source and target data sets. Develop UAT plans and conduct QA
- Simulate, load-test, and performance analyze a complex distributed system
- Develop and cultivate expertise in current and new technologies and tools
- Create automated self-service procedures that allow the platform to scale across multiple agile projects with minimal overhead
- Translate complex functional and technical requirements into detailed design
- Languages : Java, XML, Python, SQL/HQL, Numpy
- Database : MySQL, Oracle, Snowflake, Teradata, SAP, Hadoop
- GUI : HUE, JIRA
- Tools : Docker, Jenkins, GitHub, Maven, Putty, HIVE,
O/s : Windows 7, Linux, Unix |
Comments
Post a Comment
Thanks