Certified Training   

 

Big Data & ETL Training


Program duration:   

Non-IT professionals: 6 months (336h)

IT professionals: 4 months (224h)

Objectives:

    Master ETL Processes: Gain skills in data ingestion, cleaning, transformation, and loading to structured and unstructured storage.

    Big Data Storage & Processing: Work with Hadoop and Spark for managing and processing large datasets. 

   Data Analytics & Reporting: Utilize BI tools and ML techniques for data analysis and visualization. 


Modules:

ETL & Data Processing (28h)/(42h)   

  • Introduction to Python programming
  • Introduction to relational and NoSQL databases
  • Introduction to ETL
    • Fundamentals of ETL and data warehousing
    • Data quality and validation techniques
    • Batch processing vs Streaming
  • ETL & Data Pipelines
    • SQL-based ETL
    • No SQL data handling (MongoDB, Cassandra)
  • Capstone Project Depending on the Job domain (Finance, GeoLocation, Health, etc.) (28h)/(42h)

✅ Big Data Storage & Processing (28h)/(42h)

  • Hadoop Ecosystem (HDFS, Hive, HBase)
  • Apache Spark for large-scale data
  • Capstone Project Depending on the Job domain (Finance, GeoLocation, Health, etc.) (28h)/(42h)


 ✅  Real-time Data Streaming & Processing (28h)/(42h)

  • Kafka for data ingestion
  • Spark Streaming for real-time analytics
  • Capstone Project Depending on the Job domain (Finance, GeoLocation, Health, etc.) (28h)/(42h)

Machine Learning for Data Analytics  (28h)/(42h)  

  • Introduction to ML in Analytics
  • Supervised vs. Unsupervised Learning
  • Regression & Classification Models
  • Clustering
  • Model Evaluation & Performance Metrics
  • Capstone Project Depending on the Job domain (Finance, GeoLocation, Health, etc.) (28h)/(42h)

NVIDIA Certifications ​

Big Data Processing:

  Enhancing Data Science Outcomes with Efficient Workflow

​ Data Parallelism: How to Train Deep Learning Models on Multiple GPUs

​ Model Parallelism: Building and Deploying Large Neural Networks

Fundamentals of Accelerated Data Science

Accelerated Data Engineering Pipelines ​ 

Big Data Protection & Security: ​

   Building AI-Based Cybersecurity Pipelines

​  Application of AI for Anomaly Detection

​  Application of AI for Predictive Maintenance

 (+216) 98 106 016  -(+216) 98 270 400

   training-center@horizon-tech.tn