Land top tech jobs in Silicon Valley! Find software, data, and AI roles at the biggest U.S. startups and tech giants.

Search This Blog

GCP Data Engineer :: Sunnyvale, CA (Onsite)

Job Title: GCP Data Engineer

Location: Sunnyvale, CA (Onsite)

Position Type: Long-term contract

 

Mandatory skills:

Spark

Scala

GCP

Airflow

Dag

ETL

Pyspark

 

As a Senior Data Engineer, you will
• Design and develop big data applications using the latest open source technologies.
• Desired working in offshore model and Managed outcome
• Develop logical and physical data models for big data platforms.
• Automate workflows using Apache Airflow.
• Create data pipelines using Apache Hive, Apache Spark, Apache Kafka.
• Provide ongoing maintenance and enhancements to existing systems and participate in rotational on-call support.
• Learn our business domain and technology infrastructure quickly and share your knowledge freely and actively with others in the team.
• Mentor junior engineers on the team
• Lead daily standups and design reviews
• Groom and prioritize backlog using JIRA
• Act as the point of contact for your assigned business domain

Requirements:
GCP Experience

• 4+ years of recent GCP experience
• Experience building data pipelines in GCP
• GCP Dataproc, GCS & BIGQuery experience


• 12+ years of hands-on experience with developing data warehouse solutions and data products.
• 6+ years of hands-on experience developing a distributed data processing platform with Hadoop, Hive or Spark, Airflow or a workflow orchestration solution are required
• 5+ years of hands-on experience in modeling and designing schema for data lakes or for RDBMS platforms.

Design, develop, and automate data processing workflows using Airflow, PySpark, and Dataproc on GCP.

Develop ETL (Extract, Transform, Load) processes that handle diverse data sources and formats.

Manage and provision GCP resources including Dataproc clusters, serverless batches, Vertex AI instances, GCS buckets, and custom images.

Provide platform and pipeline support to analytics and product teams, troubleshooting issues related to Spark, Big Query, Airflow DAGs, and serverless workflows.
• Experience with programming languages: Python, Java, Scala, etc.
• Experience with scripting languages: Perl, Shell, etc.
• Practice working with, processing, and managing large data sets (multi TB/PB scale).
• Exposure to test driven development and automated testing frameworks.
• Background in Scrum/Agile development methodologies.
• Capable of delivering on multiple competing priorities with little supervision.
• Excellent verbal and written communication skills.
• Bachelor's Degree in computer science or equivalent experience.
The most successful candidates will also have experience in the following:
• Gitflow
• Atlassian products – Bitbucket, JIRA, Confluence etc.
• Continuous Integration tools such as Bamboo, Jenkins, or TFS

 
 
 
 
 
 

No comments:

Post a Comment

Thanks

Gigagiglet
gigagiglet.blogspot.com