Expressvpn Glossary
Canonical data model (CDM)
What is a canonical data model?
A canonical data model (CDM) is a shared reference format that apps and services use to exchange data. It defines common fields, such as identifiers, logs, event types, and application data. This helps systems use and interpret data consistently, supporting clear and efficient communication.
How does a canonical data model work?
A CDM uses a shared schema as an intermediate model for exchanging data between systems. This is often used in a hub-and-spoke integration approach, where each system maps to the CDM rather than creating separate mappings for every other system. The process unfolds in these steps:
- Apps or services exchange data through an integration layer that uses the CDM as the shared schema. The data may be serialized as JavaScript Object Notation (JSON), Extensible Markup Language (XML), or another format.
- When sending requests or information, each app maps its native data into the CDM-defined format.
- Integration tools, such as enterprise service buses (ESBs), middleware, integration platforms, or API gateways, route data between communicating apps and may help apply mappings or transformations.
- When the information reaches another app or platform, that app converts the standardized data back into its native format.

Why is a canonical data model important?
A properly implemented CDM can simplify scalability and governance while supporting more consistent, reliable data for analytics.
Having a shared data framework can speed up enterprise application integration (EAI). Each new service maps to the shared model rather than requiring separate point-to-point mappings with every other service, thereby improving efficiency. This makes it easier for organizations to onboard new applications and scale operations.
Canonical models may also streamline maintenance. If one application changes its data structure and the shared model remains unchanged, only that app’s mapping needs to be updated. This isolates changes between connected services and can make updates easier to manage.
Standardization can also support governance and compliance by establishing a consistent reference for how shared data is defined. Fields such as customer IDs, event types, product codes, timestamps, and order dates follow the same structure, naming conventions, and format across applications. This helps prevent mismatches and other errors.
Where is it used?
Common applications of CDM include:
- APIs, messages, and service-oriented architecture (SOA): Standardizing requests, responses, or messages between shared services.
- Extract, transform, and load (ETL) processes: Transforming source data into a common structure before loading it into data warehouses, data lakes, or analytics systems.
- Enterprise data platforms: Unifying customer, operational, and transactional data from multiple systems into a consistent structure.
- Identity provisioning and access systems: Aligning user IDs, groups, roles, and attributes across identity providers and applications.
- Security monitoring and analytics: Normalizing logs and event data from firewalls, security information and event management (SIEM) tools, and endpoint security tools.
- Cloud and hybrid environments: Integrating data across on-premise systems and cloud services.
Risks and challenges
CDMs can require significant effort to set up because teams must agree on shared data definitions and mappings. Some app data may not fit established schemas cleanly. Additional transformations may be needed, which increases the risk of introducing inconsistencies.
Updating the model can also become challenging over time, since changes may affect multiple connected apps and risk breaking existing mappings. As more systems depend on the shared model, even small changes can take longer and increase coordination effort.
As a result, organizations may stagger the adoption of CDMs. Older system-to-system integrations may also limit compatibility and often continue alongside the CDM.
Further reading
- What is a data warehouse? A complete guide
- What is big data security and privacy?
- Security concerns in cloud computing and how to address them
- What is access control? How it works and why it matters