(此課程內容只提供英文版)
How to become data scientist (FREE SEMINAR)
Date: TBC
Time: TBC
Venue: VTC Tower, Wan Chai
Target Audience: Those who are interested in big data analytics, IoT and the career of a Data Scientist
Enquiry and Enrolment:
(此课程内容只提供英文版)
Introduction
The course aims to provide students with knowledge in Big Data platforms, data modelling, virtualization and analytics. Participants are expected to get familiar with different big data platforms, programming methodologies and virtualization tools for analytics. The programme provides an insight on how business world is using Big Data to improve their business models.
Objectives
- Candidates can setup Big Hadoop Data Platform and start Big Data process into HBase and extract HBase data for other integration and business cases.
- Candidate can setup Hive for advanced level data process engine for other integrated module.
- Candidate can apply streaming engine to stream social media contents into Hive and moving data between Hadoop, HBase to Hive
- Candidate will learn how to create Java Program to access HBase Table directly including Table operation, data processing
- Candidate will learn how to create Java Program to access Hive JDBC for data processing
- Candidate will learn basic R programming and some basic modelling such as Decision Tree, Linear Regression, Non-linear Regression, Neural Network with Demo Data from R and some stock market data from R virtualization tool.
- Candidate will learn how to use Qlikview for data presentation (Personal Edition)
- Create Dashboard from QlikView with basic analytic features and business cases
- Candidate will understand, what is Advance Data Analytics Tools (Raid Miner, Alteryx and business cases)
- Candidate will learn some basic troubleshooting skill during the exercise.
(此课程内容只提供英文版)
Course Contents
- Hadoop Big Data Platform
- Hadoop Big Data Framework & Platform
- Hbase Introduction, Setup and Practice
- Hive Introduction, Setup and Practice
- Flume Introduction, Setup and Practice
- R Programming
- External Integration with R Studio
- Data Modelling (Decision Tree, Linear Regression, Non-linear Regression, Neural Network)
- Data Analytics
- Introduce Data Virtualization Tool for Analytics (QlikView)
- Introduce Data Virtualization Tool for Analytics (RaidMiner, Alteryx)
Speaker Profile
Being a certified PMP, ITIL, SAS, Java, SAP FI/CO and PP consultant, Mr. Joe Chan has 25 years’ experience in the IT industry with more than 15 years in management role for implementation of SAP ERP, CRM and Business Intelligent solution for HK MNC and factory plants in China and other countries across various industries. Mr. Chan has been the PM of a billion TW dollars SAP project in Taiwan. One of the major tasks was to ensure the SAP solutions comply with all regulatory statutes and adhere to the best practices. Since 2013, he has been focusing on implementing Big Data framework and analytics as well as machine learning solution for different industries.