Workshops and Hackathon: From Data Sourcing to Web Publishing Series 2020/21

01/02/2021 to 23/05/2021

We are pleased to offer the following workshops on the theme “From Data Sourcing to Web Publishing” from February to April and a hackathon in May 2021.All CUHK students and staff are welcome to join.

Co-organizers: Center for Entrepreneurship, CUHK Library, Data Science and Policy Studies Programme, Department of Cultural and Religious Studies, Department of Management, Department of Social Work, Entrepreneurship and Innovation Minor Programme, School of Journalism and Communication of CUHK


The workshops are classified in five categories:

  1. Problem Definition
  2. Data Preparation
  3. Analysis/Modeling/Validation
  4. Presentation/Visualization/Storytelling
  5. Deployment

Students who attend at least one workshop in every category will be issued a certificate of attendance (Introduction to Data Competency Certificate). For those workshops that fall into multiple categories, only one category can be considered for fulfilling the requirement. Students who want to receive the certificate are required to attend at least 5 workshops, with one from each category.   











Visualising Network Relationships with Gephi

(Digital Scholarshop & Research Data Workshops) 

23 Feb 2021 (Tue)

3:00 - 4:15 pm

Click here to register






The ABC of Digital Competency: A Layman’s Guide to AI, Big Data and Cloud computing

Instructor :Prof. Bernard Suen, Center for Entrepreneurship, CUHK 

AI, Big Data, and Cloud Computing have become the digital infrastructure for application development on the net. Without the cloud forming the backbone for robust operations and outsourcing of core hardware/software system and security services, it won't be possible for businesses and research institutes to scale up their data acquisition, storage, and processing capacities. As the volume, velocity, variety, and veracity of the datasets grow, powerful parallel and distributed processing resources provided by the cloud platforms can enable faster R&D and innovation cycles. Upon such foundation, AI and machine learning technologies can be applied to the huge datasets for building complex analytic models.
This workshop aims to articulate the relationships of the three as an advancement of digital literacy for understanding the ABC of modern computing.

25 Feb 2021 (Thu)


2:30 - 5:30 pm

Click here to register






Introduction to Programming in Python

Instructor :Prof. Bernard Suen, Center for Entrepreneurship, CUHK 

Python is one of the most versatile languages to learn for mastering computer and data science. The simple and elegant syntax of Python makes it an ideal language for beginners. Not only that, Python has also been used by scientists and engineers to build complex models and systems.
This hands-on workshop aims to cover the basics of the Python language:

  • a) Basic data types and computational operations 
  • b) Data collections such as lists and dictionaries for building advanced data structures
  • c) Introductory use of modules, functions, and classes for designing and implementing algorithms

Jupyter Notebook will be used in the workshop for programming in Python.

27 Feb 2021 (Sat)


9:30 am - 12:30 pm,

2:00 -  5:00 pm

Click here to register






Data Scraping and Cleaning for Non-Programmer

Invited Speaker: Prof. Liang Hai, School of Journalism and Communication, CUHK

In this session, we will introduce how to collect various data automatically from popular websites using API and web scraping. In addition to the basic logics of web crawling and cleaning, we will use KNIME for hands-on demonstration.


Participants can install KNIME ( to their device prior to the workshop.

5 Mar 2021 (Fri)

2:30  - 5:30 pm

Click here to register






Detecting Correlation and Relationship in Data

Invited Speaker: Dr. Ho Chi Pui, Data Science and Policy Studies Programme, Faculty of Social Science, CUHK

This workshop reviews the key techniques in detecting correlation and relationship in data. Key topics to be covered are descriptive statistics, correlation, simple and multiple regression, data visualization and outlier detection. Examples will be used to illustrate the above techniques.

11 Mar 2021 (Tue)


2:30 - 4:30 pm

Click here to register






Introduction to R for Statistical Analysis

(Digital Scholarship & Research Data Workshops)

Invited Speaker: Prof. Michael Chan, School of Journalism and Communication, CUHK

12 Mar 2021 (Fri)


2:30 - 4:30 pm

Click here to register






Beginning Web Publishing II: Programming in JavaScript

Instructor: Prof. Bernard Suen, Center for Entrepreneurship, CUHK

This hands-on workshop aims to cover the basics of the JavaScript language:

  • a) Using JavaScript as a front-end tool for manipulating HTML and CSS
  • b) Data collections such as arrays and objects for building advanced data structures
  • c) Use of functions and event listeners for adding interactivity to the user interfaces
  • d) Introduction to JSON and JS Fetch API for basic back-end communication and front-end  data visualization in Leaflet/OpenStreetMap and C3/D3.

Knowledge of HTML/CSS and concept of DOM (Document Object Model) and box model are required as pre-requisite for signing up the workshop. The attendees are also expected to know how to use a code editor such as Visual Studio Code, Atom, Sublime Text, or Notepad++.

13 Mar 2021 (Sat)


9:30 am – 12:30 pm,

2:00 - 5:00 pm

Click here to register






Introduction to ArcGIS and QGIS for Digital Scholarship Projects

(Digital Scholarship & Research Data Workshops)

16 Mar 2021 (Tue)


3:00 - 4:30 pm


Click here to register






Monitoring living environment using Micro:bit

Invited Speaker: Mr. Clarence Fong, Meteorologist, United Nations ESCAP/WMO Typhoon Committee Secretariat  

Temperature and relative humidity vary everyday. A good combination of the both is essential for a comfortable living. This workshop introduces the monitoring of temperature and relative humidity using Micro:bit, how to interpret the readings, and optimize the environment. Simple Javascript programming is required. The readings will be published to the IoT (Internet of Things) platform for user browsing and data sharing.

19 Mar 2021 (Fri)


2:30 - 5:30 pm

Click here to register

(Registration will be started later)






Web and Database Programming in Python

Instructor: Prof. Bernard Suen, Center for Entrepreneurship, CUHK

This workshop aims to introduce Python Flask, a popular web application framework for connecting to SQL (e.g. MySQL) and NoSQL (e.g. MongoDB) databases on the back-end and serving RESTful API requests originated from the front-end (e.g. web browser or mobile) following a MVC (model-view-controller) framework.
 This hands-on workshop aims to cover the following:

  • a) Set up of the Flask framework enabled with the Jinja2 template system
  • b) Set up of MySQL database for basic CRUD (create, retrieve, update, and delete) support
  • c) Introduction to authentication and authorization using Basic Auth and JWT (Java Web Tokens)
  • d) Setup of RESTful API endpoints and JSONification of MySQL data for data exchange and operations
  • e) Discussions and demonstrations of advanced algorithmic model integration for data scraping, IoT connections, exploratory data analysis, machine learning, and data visualization.

*Knowledge of Python, HTML/CSS/JavaScript and SQL databases are required for signing up the workshop.

10 Apr 2021 (Sat)

9:30 am - 12:30 pm,

2:00 - 5:00 pm

Click here to register 






Basics of Data Mining with Weka (Hands-on)

(Digital Scholarship & Research Data Workshops)

15 Apr 2021 (Thu)

11:00 - 12:30 pm

Click here to register






Venue: Online

Medium of Instruction: Cantonese (English might be used if users require.)


* Pre-requisite  

More workshop information will be provided later.



Main Theme: Post-COVID-19 Opportunities: What People want to Do After the Pandemic 

Tentative Date: 21 (Fri) – 23 (Sun) May 2021


  • Promoting innovation skills, including data and digital competency, to CUHK students and staff
  • Enhancing multi-disciplinary and multi-dimensional collaboration among CUHK students and staff


  • Within 48 hours, participants will apply knowledge and skills from what they have learnt in the pre-hackathon workshops about data mining to solve the current problem in the Post-COVID-19 era. Participants shall select one sub-theme among four and work as a one group to provide a solution in an innovative way.Participants will have a group presentation of their solutions in Day 3.
  • The winning presentations will be selected with awards.


Eligibility: CUHK students, 3 to 5 persons per group from 2 different Faculty groups

Group 1

Group 2

Faculty of Arts

Faculty of Science

Faculty of Business Administration

Faculty of Engineering

Faculty of Education

Faculty of Medicine

Faculty of Law


Faculty of Social Science


*More details about the Hackathon will be provided later