Job Description
Cloudera Data Engineer
We have an exciting opening for a full-time Cloudera Data Engineer
to join our innovative Software Development team in Madison, Wisconsin!
Join a team recognized as one of Madison Magazine's Best Places to Work, where innovation thrives, collaboration drives success, and your work makes a real-world impact-because at Yahara, we don't just build data pipelines, we empower people and transform industries.
Important Notes about this Position:
* This position offers remote work flexibility but is only open to candidates who reside in or are willing to relocate to the greater Madison, WI area.
* We are unable to provide sponsorship at this time.
Summary
The Cloudera Data Engineer designs and maintains enterprise-scale data pipelines using the Cloudera Data Platform (CDP) and related big data technologies. The role focuses on building scalable ETL/ELT workflows, optimizing distributed compute, and enabling secure, high-performance data services across multiple domains. Work is highly collaborative within cross-functional Agile teams.
Our Approach
We build data solutions grounded in strong engineering fundamentals-reliable architecture, quality controls, and scalable design. We use modern cloud platforms, integrating analytics and ML where valuable while prioritizing data integrity and governance.
What You'll Do
- Design and maintain enterprise-scale pipelines using CDP and big data tooling.
- Build scalable ETL/ELT workflows for structured and unstructured data.
- Develop distributed processing jobs using big data framework components.
- Design data storage solutions balancing performance and cost.
- Collaborate with analysts, scientists, and developers to deliver data solutions.
- Develop technical documentation for pipelines and architectures.
What You'll Bring
Experience & Education:
- 5-7 years in data engineering with big data or distributed systems.
- Experience with CDP, CDH, or similar enterprise big data platforms.
- Degree in CS, Data Science, Information Systems, or equivalent experience.
- Strong background in distributed data processing.
- Ability to obtain and maintain Public Trust clearance.
Mindset & Approach:
- Self-starter with a passion for data engineering.
- Strong analytical and problem-solving skills.
- Enthusiastic about big data technologies and performance optimization.
- Detail-oriented with a commitment to accuracy and reliability.
- Ability to translate business requirements into effective solutions.
- Collaborative, able to recognize blockers and leverage team strengths.
Technical Background:
- Experience with Agile development environments.
- Proven experience designing and implementing production pipelines.
- Experience in biohealth, laboratory, or scientific data environments is a plus.
- Familiarity with HIPAA, FDA, or GxP preferred but not required.
Specific Technical Qualifications
- Cloudera ecosystem experience: CDP, HDFS, Hive/Impala, Spark.
- Programming: Python, Scala, or Java.
- Advanced SQL and distributed compute (Spark, MapReduce).
- Shell scripting and version control (Git).
- Data storage formats: Parquet, Avro, ORC.
- Workflow orchestration and scheduling.
- Cloud experience (Azure, AWS, or GCP) and understanding of hybrid patterns.
Company Benefits & Perks
- 20+ days of PTO accruable in the first year!
- Comprehensive health insurance (Medical, Dental, Vision) with HMO and PPO options
- Health Savings Account (HSA) with annual employer contributions
- 401(k) with guaranteed company match (Traditional and Roth options)
- 100% company-paid short-term and long-term disability
- 100% company-paid life insurance with option to increase coverage
- 100% company-paid identity theft protection
- On-site gym with basketball court
- Hybrid/remote schedule with home office stipend
- Fresh fruit, healthy snacks, and beverages provided daily
- Bonus certification program (Microsoft, AWS, PMP, IIBA, etc.)
- Employee Assistance Program (counseling, legal, financial services)
- Monthly and Quarterly Recognition Awards with spot bonuses
- Company-supported community outreach and volunteer opportunities
- Employee-run committee involvement opportunities
- Collaborative culture founded on realized values and incredible people
If you need an accommodation as part of the employment process, please contact Human Resources via email at hradmin@yaharasoftware.com
Yahara Software LLC is an Equal Employment Opportunity/Affirmative Action Employer.
This is a full-time, salaried position with competitive salary and benefits. Candidates must be eligible to work in the U.S. on a permanent basis and can work on-site in our office located in Madison, Wisconsin.
Job Tags
Permanent employment, Full time, Temporary work, Work at office, Relocation, Home office