form that a computer can process. - Data can: - Be personal - Be transactional - Come from the web - Come from sensors i.e. IoT WHAT’S THE DEAL ABOUT DATA? 4
light. With that comes an increase in data processing speeds and the addition of data creators and consumers. - Data is the key to smooth operation of companies. Without data, progress would halt and we can’t make decisions. - The modern data ecosystem is expanding! Traditional tools and methods of analysis may no longer cut it. - Need new tools and techniques, which paves the way to new knowledge and insight. - But data can’t do much without people. WHAT’S THE DEAL ABOUT DATA? 6
for business operations and analysis - Extract, integrate and organise data - Clean, transform and prepare data - Store data and manage data repositories to be used. - Tools - SQL - Python or R - Databases - Data integration platforms like Mulesoft DATA ENGINEER 8
or deep learning models - Train past data to build predictive models. - Pick up on problems that will have most value once addressed. - Analyse data to answer questions. “How many Twitter followers am I likely to get next month?” - Tools and Knowledge - SQL - Python or R - Excel - BI Tools, e.g. Tableau and Power BI - Mathematics and statistics - Databases - Building data models DATA SCIENTIST 9
- Inspect data for deriving insights, fi nding correlations and patterns. - Present and visualise data to tell a story. - Profession speci fi c data analysts include marketing analysts, fi nancial analyst, business analyst. - Tools and Knowledge - SQL - Excel - Python or R - Business Intelligence tools e.g. Tableau and Power BI DATA ANALYST 10
and data scientists. - Provide business intelligent solutions by monitoring data on di ff erent business functions. - Use data to extract insights to improve business processes and performance. - Tools and Knowledge - SQL - Excel - Business Intelligence tools e.g. Tableau and Power BI BUSINESS ANALYST 11
the rise of the data scientist. - Machine learning (automating model building) and deep learning (training a computer to perform human-like tasks) is in their toolbox. DATA SCIENCE SPOTLIGHT ON: 12 Source: the Complete Data Science Landscape by Zohaib Ahmed https://www.kaggle.com/general/210722
performing complex calculations at high speed. - Australia has 4 supercomputers for research purposes. - Magnus (Pawsey - Perth) - FlashLite (UQ) - MASSIVE (Monash) - GPU - Gadi (NCI - ANU) - Mechanisms of protein binding and receptor inhibition in COVID-19. - Bush fi re predictive modelling. HIGH PERFORMANCE COMPUTING FOR RESEARCH SPOTLIGHT ON: 13