Slide 1

Slide 1 text

Domain Data Platform for Scalable Data Management Ball (Weera Kasetsin) CPO, LINE Thailand

Slide 2

Slide 2 text

No content

Slide 3

Slide 3 text

And this…

Slide 4

Slide 4 text

Spot Any Difference?

Slide 5

Slide 5 text

คุณเคยประสบปัญหาเหล่านี้หรือไม่? > I want to ingest my application (domain) data to IU, but I have to wait for a data engineer to do it > A data engineer has to learn every schema > When the IU resource is insufficient, you cannot use any BI reports, even if it’s just the operational report

Slide 6

Slide 6 text

What-if > You (and your members) have more flexibility to develop and use BI reports or operational reports with less dependence on the central platform > No need many data engineers to handle data ingestion. > Engineers can handle data integration with less effort (More focused on creating data facility tools) > We can fully utilize IU computing resources rather than data ingestion task

Slide 7

Slide 7 text

Current State Data Integration and Data Analysis

Slide 8

Slide 8 text

No content

Slide 9

Slide 9 text

No content

Slide 10

Slide 10 text

What we believed Centralized Data Analytic Platform > Improved collaboration (maximize data utilization across the company) > Fewer resources required > Help to streamline processes > Improve security > Better data quality

Slide 11

Slide 11 text

Centralized Data Analytic Platform (CDAP) What we know about it > Monolithic designs > Centralistic operation models > Always complex due to various requirements > Insufficient resource utilization in the analysis

Slide 12

Slide 12 text

CDAP What we know about it

Slide 13

Slide 13 text

The Downside of CDAP > The complexity of raw data is that use cases always require reworking the data. > Data quality problems must be sorted out, transformations are required, and other data are enriched to bring the data into context. > When data is repeatedly copied and scattered throughout the organization, it becomes more difficult to find its origin and judge its quality.

Slide 14

Slide 14 text

The Downside of CDAP > Requires you to develop a single logical view of the same data that is managed in different locations. > Extensive data distribution makes controlling the data much more difficult because data can be spread even further

Slide 15

Slide 15 text

This requires organizations to redefine how people, processes, and technology are aligned with data

Slide 16

Slide 16 text

Data Strategy What is it? A data strategy is a long-term plan that defines the technology, processes, people, and rules required to manage an organization's information assets.

Slide 17

Slide 17 text

Data Strategy Best practice - Data strategy 2 parts: > Operational transactional processing > Analytical data warehousing and big data processing

Slide 18

Slide 18 text

Data Strategy > Focus on your business goals and strategy > Determine the correct balance between “defensive” and “offensive.” > Is full control a top priority? Or flexibility for innovation? > How does regulation impact your strategy? > These considerations will influence your initial design and the pace of federating certain responsibilities.

Slide 19

Slide 19 text

Data Strategy > Operational analytics, focuses on predicting and improving the existing operational processes > The analytical results need to be integrated back into the operational system’s core so that insights become relevant in the operational context.

Slide 20

Slide 20 text

Data Strategy > Regulations, such as the new EU laws on data governance and artificial intelligence > Force large companies to be transparent about what data is collected and purchased, what data is combined, how data is used within analytical models, and what data is distributed (sold)

Slide 21

Slide 21 text

Provide the Self-serve Data analytic platform for data users at All level – LCT Data Mission

Slide 22

Slide 22 text

Future of Data Strategy New LCT’s Data Strategy and Platform Architecture Balance the centralized and decentralized data strategy, which includes customer-focused, business functions, legal, finance, compliance, and company-wide data governance.

Slide 23

Slide 23 text

Implementation the new Data Strategy (1) Generating refined data assets within the Domain data platform > Empowering the domain to self-managed its data and authorizing Self-served domain analytics > Cost-effective utilization of On-Demand computation resources > Reduce Time to consume data

Slide 24

Slide 24 text

Implementation the new Data Strategy (2) Publishing Curated Data Assets to a Central Data Platform for Cross-Domain Analysis. > Mitigate spoiled data assets, which were prematurely ingested in a centralized data platform from domain raw data assets. > Minimizing Premature Data Governance Efforts

Slide 25

Slide 25 text

Domain-Data Concept > The (domain) context of the business problem influences the design of the application and finds its way into the data > Unique business problems require unique thinking, unique data, and optimized technology to provide the best solution

Slide 26

Slide 26 text

Data Mesh Architecture > An (exciting new) methodology for managing data at large. > The concept foresees an architecture in which data is highly distributed and a future in which scalability is achieved by federating responsibilities. > It puts an emphasis on the human factor and addresses the challenges of managing the increasing complexity of data architectures.

Slide 27

Slide 27 text

Principles for Distributed and Domain-Oriented Data Management 1. Avoid data silos 2. Only capture and modify data at the golden source 3. Respect the rules of data ownership

Slide 28

Slide 28 text

Domain Ownership Responsibilities > Taking ownership of data pipelines, such as ingesting, cleaning, and transforming data, to serve as many data customers’ needs as possible > Improving data quality and respecting service level agreements (SLAs) and quality measures set by data consumers > Encapsulating metadata or using reserved column names for fine-grained row/column-level filtering and dynamic data masking

Slide 29

Slide 29 text

Domain Ownership Responsibilities Adhering to metadata management standards, > Application and source system schema registration > Providing metadata for improved discoverability > Observing versioning rules > Linking data attributes and business terms > Ensuring the integrity of metadata information to allow better integration between domains

Slide 30

Slide 30 text

Domain Ownership Responsibilities > Adhering to data interoperability standards, including protocols, data formats, and data types > Providing lineage, either manually or by linking source systems and integration services to scanners > Completing data-sharing tasks, including identity and access management reviews and data contract creation

Slide 31

Slide 31 text

No content

Slide 32

Slide 32 text

No content

Slide 33

Slide 33 text

Future – New LCT Data Analytic Platform

Slide 34

Slide 34 text

No content

Slide 35

Slide 35 text

No content

Slide 36

Slide 36 text

No content

Slide 37

Slide 37 text

No content