Menu

Find a Job Create an Alert My Saved Jobs Companies Browse Courses Career Advice Employers and Recruiters
Careers24 logo
EMPLOYERS & RECRUITERS | POST A JOB

Menu

Log In Sign Up

Share This Vacancy

Send an email about this job to a friend or to yourself.

  • Home
  • /
  • Job Search
  • /
  • South Africa
  • /
  • Gauteng
Back to Search Results

Data Engineer (Hadoop ecosystem)

  • Johannesburg
  • Salary: Market Related
  • Job Type: Contract
  • Sectors: IT Investment
  • Reference: 154928

Apply before Dec 20 2025 | 12 Days left

Apply

Vacancy Details

Employer: Praesignis (Pty) Ltd

We are seeking a skilled Data Engineer to design and develop scalable data pipelines that ingest raw, unstructured JSON data from source systems and transform it into clean, structured datasets within the Hadoop-based data platform. The ideal candidate will play a critical role in enabling data availability, quality, and usability by engineering the movement of data from the Raw Layer to the Published and Functional Layers.


Key Responsibilities:

  • Design, build, and maintain robust data pipelines to ingest raw JSON data from source systems into the Hadoop Distributed File System (HDFS).
  • Transform and enrich unstructured data into structured formats (e.g., Parquet, ORC) for the Published Layer using tools like PySpark, Hive, or Spark SQL.
  • Develop workflows to further process and organize data into Functional Layers optimized for business reporting and analytics.
  • Implement data validation, cleansing, schema enforcement, and deduplication as part of the transformation process.
  • Collaborate with Data Analysts, BI Developers and Business Users to understand data requirements and ensure datasets are production-ready.
  • Optimize ETL/ELT processes for performance and reliability in a large-scale distributed environment.
  • Maintain metadata, lineage and documentation for transparency and governance.
  • Monitor pipeline performance and implement error handling and alerting mechanisms.


Technical Skills & Experience:

  • 3+ years of experience in data engineering or ETL development within a big data environment.
  • Strong experience with Hadoop ecosystem tools: HDFS, Hive, Spark, YARN and Sqoop.
  • Proficiency in PySpark, Spark SQL, and HQL (Hive Query Language).
  • Experience working with unstructured JSON data and transforming it into structured formats.
  • Solid understanding of data lake architectures: Raw, Published, and Functional layers.
  • Familiarity with workflow orchestration tools like Airflow, Oozie, or NiFi.
  • Experience with schema design, data modeling, and partitioning strategies.
  • Comfortable with version control tools (e.g., Git) and CI/CD processes.


Nice to Have:

  • Experience with data cataloging and governance tools (e.g., Apache Atlas, Alation).
  • Exposure to cloud-based Hadoop platforms like AWS EMR, Azure HDInsight, or GCP Dataproc.
  • Experience with containerization (e.g., Docker) and/or Kubernetes for pipeline deployment.
  • Familiarity with data quality frameworks (e.g., Deequ, Great Expectations).

 


Apply

3 people have applied for this job. 525 people have viewed this job.

Previous
Next

More Jobs at Praesignis (Pty) Ltd

Senior Data Analyst

  • Johannesburg
  • Job Type: Contract
  • Posted 30 Oct 2025 | 21 Days left

Senior Software Engineer

  • Johannesburg
  • Job Type: Contract
  • Posted 30 Oct 2025 | 25 Days left

Infrastructure Manager – IT & Electrical Systems

  • Johannesburg
  • Job Type: Permanent
  • Posted 28 Oct 2025 | 20 Days left

About Careers24.com

  • Careers24 is a leading South African job portal that assists jobseekers from all sectors and experience levels to find and apply for vacancies from hundreds of South Africa’s leading companies. With over a million visitors a month, we are one of the most popular destinations to find employment online in South Africa.

    Our mission is to become the destination where every candidate can find an ideal job and where every company can find the perfect candidate. We are a proud member of the Media24 family and part of the Naspers group of companies.

Company

  • Work For Us

  • Advertise With Us

Quick Links

  • Recruiters Directory

  • Site Map

  • Site Terms and Privacy Policy

  • Widgets

  • Recruiter Terms

  • Faq

Partner Sites

  • Simplify.hr

  • News24

  • Property24

  • Media24

Follow Us


© Careers24. All rights reserved.

About Careers24.com

  • Careers24 is a leading South African job portal that assists jobseekers from all sectors and experience levels to find and apply for vacancies from hundreds of South Africa’s leading companies. With over a million visitors a month, we are one of the most popular destinations to find employment online in South Africa.

    Our mission is to become the destination where every candidate can find an ideal job and where every company can find the perfect candidate. We are a proud member of the Media24 family and part of the Naspers group of companies.

Company

  • Work For Us

  • Advertise With Us

Quick Links

  • Recruiters Directory

  • Site Map

  • Site Terms and Privacy Policy

  • Widgets

Partner Sites

  • Simplify.hr

  • OLX

  • News24

  • Property24

  • Media24


© Careers24. All rights reserved.