Data Engineer
Outbrain
(New York, New York)Outbrain is the world’s largest content discovery platform, bringing personalised, relevant online, mobile and video content to audiences while helping publishers understand their audiences through data. Outbrain serves over 190 billion personalised content recommendations every month and reaches over 561 million unique visitors from across the globe.
You must be a self-starter with strong design and coding skills who loves shipping great product. You will be responsible for the design and development of the machine learning products and data systems that provide rich insight into web traffic performance using Visual Revenue's unique datasets. You will challenge yourself and others to constantly come up with better solutions behind the ultimate real-time analytics platform for news editors.
- Design, build, and deploy high-performance production platforms/infrastructure to support data warehousing, real-time ETL, and batch big-data - processing
- Detect data quality issues all the way down to root cause, and implement fixes and data audits to prevent/capture such issues
- Create compelling PoC’s for data solutions using emerging technologies for real-time and big data ingestion and processing
- Optimize code and design architecture to streamline data science processes
- Build, automate and deploy specialized research environments
Required Qualifications:
- 3+ years of experience in software engineering
- Strong programming experience with any of: Python, Scala or Java
- In-depth understanding of object oriented programming concepts
- Experience with batch and real-time data processing frameworks like Storm, Kafka, or Spark
- Effective knowledge of SQL, especially Hive, and NoSQL databases
- Expertise with version control systems, especially with Git - Proficiency with Unix/Linux environment
Preferred Qualifications:
- Experience with data mining, machine learning, and underlying algorithms
- Experience with statistical methods and experimentation (A/B testing)
Questions
There are no answered questions, sign up or login to ask a question
- Algorithms
- Architecture
- Big Data
- Data Mining
- Data Processing
- Data Science
- Databases
- Infrastructure
- Java
- Linux
- Python
- Scala
- SQL
- Unix
- A/B Testing
- Data Ingestion
- Data Warehousing
- Git
- Machine Learning
- NoSQL
- Version Control
- Object Oriented Programming
- Data Solutions
- ETL
- Storm
- Architectural Design
- engineering
- Data Quality
- Software

Want to see jobs that are matched to you?
DreamHire recommends you jobs that fit your
skills, experiences, career goals, and more.