Data Engineer -II

Bengaluru, Karnataka, India | Data Science | Full-time | COVID-19 remote

Apply

About the company

ShareChat (https://sharechat.com/about) is India’s largest social media and content marketplace platform that operates exclusively in Indic languages. We empower over 300 million strong monthly active users to share their opinions, record their lives and make new friends - all within the comfort of their language of choice. We are the leaders in the social content space in India with Moj being India’s largest short-video platform (160+ million MAUs, 50 million creators), and Sharechat being India’s leading social media and content marketplace platform that operates exclusively in 15 Indic languages (180 million MAUs, 32+ million creators).

ML Platform

The ShareChat Machine Learning (ML) Platform engineers build scalable shared components used to accelerate the pace of ML development to power billions of predictions per day. Our products seamlessly handle massive scale to magnify the impact and the quality of ML use-cases deployed across the company.

We're looking for a passionate data engineer for our Bangalore location (presently remote working) who have demonstrated the ability to design, build and scale data systems. In this role you will

  • Work in tandem with data scientists, product managers and leadership to define essential data needs for building state of the art ML models
  • Build large-scale pipelines to transform batch and real-time data into usable training and inference inputs for machine learning using processing frameworks like Kafka, Cloud PubSub, Apache Beam/Flink/Spark, Cassandra on the Google Cloud Platform
  • Write clean high-quality code which is testable, maintainable and scales up to handle billions of data points per day
  • Demonstrate expertise in data engineering challenges across the company through data solutions, standardised data sets and dashboards
  • Be a strong role model and technical guide to your team and partners by identifying and fixing missing pieces in ShareChat’s data infrastructure

As you work on these problems you will make use of your technical skills to collaborate and solve some of the most challenging data problems in a fast-moving space.

You should have,

  • 4+ years of industry experience with at least 2 years spent working with large-scale data systems
  • Strong coding skills in either Golang, Java or other JVM language. Familiarity with Python is a big plus
  • Hands on experience with real-time processing tools
  • A mindset to value team successes
  • Experience with Google Cloud Platform, Docker and Kubernetes is a big plus