

BIG DATA ANALYTICS

ABOUT
Big data
Big data is the fast-growing field of analytics, collecting and sorting through huge amounts of information to turn it into real insights that can make a difference.
On a broad scale, data analytics technologies and techniques give organizations a way to analyze data sets and gather new information.
Our Big Data training programme will equip participants with the necessary skills to successfully manage this valuable resource.
Our programme enables you to confidently harness the power of Big Data in a way that maximizes your return on investment and minimizes risks.
Want to know how we can help you? Enroll now on our Big Data training programme!
DATA SCIENTIST

Data Science Foundation
-
Module 1: Introduction to Big Data & Data Science
-
Module 2: Fundamental of Data Science
-
Module 3: Exploratory Data Analysis: Getting and cleaning data, data transformation, descriptive analytics, data visualtisation
-
Module 4: Data Management: Analytics from structured and un-structured data
Duration: 2 + 2 days


Data Mining
-
Module 1: Association rule mining and outlier analysis
-
Module 2: Classification
-
Module 3: Regression
-
​Module 4: Cluster analysis
Duration: 2 days

Social Media Analytics
-
Module 1: Mining Social Media Content
-
Module 2: Text Mining
-
Module 3: Sentiment Analysis
Duration: 2 days



Developing Data Product
-
Module 1: Developing data products using R packages
-
Module 2: Shiny and interactive graphics
Duration: 2 days
DATA ENGINEERING

Hadoop Essentials
-
Introduction to Big Data & Data Engineering
-
Hadoop Essential - Hadoop Architecture, Hadoop Ecosystem, HDFS, Hbase, MapReduce
-
​Data Management - SQL, NoSQL
Duration: 2 + 2 days



ELT Hadoop Data
-
Data Ingestion for Hadoop - Sqoop, Flume, Storm
-
Interaction with Hadoop Data
-
​Selection of the appropriate tool data extraction, transformation and loading
Duration: 2 days

Hadoop Administration
-
Module 1: Installing, configuring and managing a Hadoop cluster
-
Module 2: Monitoring a Hadoop cluster
-
Module 3: Administering a Hadoop cluster
Duration: 2 days



High Performance Processing
-
Module 1: Installation and configuration of Apache Spark for data analysis
-
Module 2: Introduction to PySpark
-
Module 3: Applying machine learning on various problems using the Spark machine learning library
Duration: 2 days
DATA MANAGEMENT

Fundamentals of Big Data
-
What is Big Data?
-
Sources of Big Data
-
Making sense of data
Duration: 2 hours



The Big Data Analysis Lifecycle
-
Business Case Evaluation
-
Data Identification
-
Data Acquisitions and Filtering
Duration: 2 hours

Leverage the Understanding of Big Data
-
Data Extraction
-
Data Validation and Cleansing
-
Data Aggregation and Representation
-
Data Analysis
-
Data Visualtisation
-
Analysis Result
Duration: 2 hours



Big Data Governance
Duration: 2 hours