Senior Data Pipeline Engineer
Radius
(San Francisco, California)Radius is a fast-growing, venture-backed startup in the heart of San Francisco. Radius applies advanced data science to deliver the freshest, most accurate, and most comprehensive view on 20M+ US companies—from small businesses to the largest enterprises. We build cutting-edge machine learning solutions that help our customers discover markets, acquire customers, and measure performance through an app that’s intuitive, secure, and enterprise-ready.
We're looking for an experienced Data Software Engineer who enjoys developing highly scalable and robust code to process large data sets. You’ll be working with a team with diverse expertise in designing and deploying distributed systems for large-scale data processing. Your team will be responsible for the validation, standardization and featurization of the source data used to build Radius’s Business Graph.
- Help leading and mentoring the team’s data engineers and scientists to maximize their potential
- Architecting and implementing a pipeline to validate, standardize and featurize millions of records on a daily basis
- Working with the team’s data scientists to turn prototypes into production-ready Scala code
- Guiding the team members and codebase in data engineering and Scala best practices
Requirements:
- 5+ years in computer engineering and/or full lifecycle software engineering experience including: coding, testing, troubleshooting and deployment
- 2+ years in data engineering or cluster computing
- 6+ months production experience in Scala or Spark
- Comfortable working in a remote Linux environment and Cloud (EC2)
Additional Preferred Qualifications:
- Experience with Hadoop
- Experience spearheading efforts around data quality and improvement
- Experience working with data scientists or on data science problems
Questions
There are no answered questions, sign up or login to ask a question
- Building Codes
- Cloud
- Data Science
- Developing and Testing Prototypes
- Hadoop
- Linux
- Scala
- Testing
- Troubleshooting
- Amazon EC2
- Apache Spark
- Cluster Computing
- Computer Engineering
- Data Engineering
- Software Engineering
- Deployment
- Distributed Systems
- Codebase
- Streaming Data Pipeline
- Data Quality

Want to see jobs that are matched to you?
DreamHire recommends you jobs that fit your
skills, experiences, career goals, and more.