NOTE:
· NEED MINIMUM 8+ YEARS OF IT EXP.
· Working with Prime Vendor.
· NO OPT/CPT
Title: Data Scientist
Location: Raleigh, NC (Remote till Pandemic)
Duration: Long-term
Client domain: Banking/Finance
Required Skills:
• Have worked with Python for lots of different types of problems, especially using the powerful analysis stack (numpy, pandas, scikit-learn)
• Have solid understanding of both classical and modern database technologies (SQL and noSQL / document based)
• Are comfortable in a Linux environment, using Git (or any version control) and are into automating everything
• Have used machine learning packages and libraries and understand fundamental techniques to building models
• Have had practical experience with streaming data and have worked with Apache Kafka
• Have worked with large unstructured, messy datasets, especially raw text.
• Must have experience using Natural Language Processing (NLP) techniques and tools
• Are interested in working with a diverse stack of tools and learning new technologies as different problems arise
Preferred Skills:
• Proficiency in modern, efficient Python development (asynchronous generators and asyncio are in your Python tool belt?)
• Have worked with graph databases (e.g. Neo4j) and/or search indices (Solr, Elastic)
• Have worked with Kafka Streams or Faust
• Have worked with larger datasets with distributed computing tools (e.g. Apache Spark)
• Proficient reading and understanding enterprise-grade Java code (Java development background a huge plus)
• Experience presenting data in custom dashboards based on NodeJS / React / JavaScript a plus
• Practical experience with Snowflake and/or public cloud services (Azure preferred)
• Working understanding of Container orchestration platforms (OpenShift / Kubernetes) a plus.
Thanks,
Sneh Purohit
Direct: 908-666-0623