Big Data - Start with data ware house

Big Data and Data Warehouse
Evolution of big data from data warehouse

Conception

  • What is big data
    • Definition of big data
    • Data processing
  • What is Data warehouse
    • Inmon or Kimball
    • Dimensional modeling
    • Star Schema vs. Snowflake Schema
    • ETL extract, transform and load
  • DW new era
    • Hadoop
    • Kafka
    • Spark
    • Oozie
    • Steam processing
  • Hands on
    • MongoDB
    • Impyla for Spark
    • Numpy Pandas Matplotlib

Dimension modeling

  • How to ETL
  • Non-structure data
  • Best practise

Preparation

  • Basic concept and experience on Python
  • Install anaconda with numpy pandas matplotlib scipy
  • MongoDB
  • SQL knowledges