Logo

Governance

Data Governance & IT Transformation: Keys to an Integrated Approach

Data Governance & IT Transformation: Keys to an Integrated Approach

Eric Draperi

Data Governance & IT Transformation: a Guide to an Integrated Approach
Data Governance & IT Transformation: a Guide to an Integrated Approach

Data governance has become a boardroom topic. CIOs are dedicating budgets, teams, and steering committees to it. Frameworks exist. Policies too. And yet, even in the most structured organizations, the same frustration keeps surfacing: the data is there, but nobody is reading the same reality.

This isn't a maturity problem - it's an integration problem. When data governance lives in its own tools, disconnected from Architecture, project portfolios, and delivery, it isn't really governing anything: it's just documenting.

Meanwhile, IT transformation keeps moving. AI agents are entering the information system without waiting for the frameworks to be ready. And decisions get made, with or without the full picture.

This article shows you how to turn data governance into an integrated steering lever, not just another reference layer running parallel to your transformation.

In a nutshell

Organizations today have established data governance frameworks. However, the rapid advancement of agent-based AI is exposing a structural limitation: governance that is disconnected from Architecture and delivery is no longer sufficient to control what is actually being deployed within the organization. This article explores how to take your data governance strategy to the next level by truly integrating it into your transformation efforts.

What is data governance, and where does it actually stop?

Defining data governance and its real scope

Data governance refers to the full set of policies, roles, and processes that frame how an organization manages its data throughout its entire lifecycle - from collection to deletion, through storage, processing, and sharing.

Its core purpose: ensuring that enterprise data is reliable, accessible to the right people, compliant with regulatory requirements, and usable in decision-making.

That scope is broader than it looks. Data governance isn't just about database quality or GDPR compliance. It covers data traceability across the information system, access rights definition, metadata management, domain-level ownership… and, increasingly, oversight of automated usage by AI models and agents. In that sense, it touches both data security and data-driven value creation for the business.

What makes it complex in large organizations is the sheer scale of what needs to be governed: an information system spanning hundreds of applications, data flowing through dozens of business processes, teams working with their own tools and reference systems. All of which confirms that data governance is, structurally, a cross-functional consistency problem. And cross-functional consistency can't be mandated into existence.

“What we see with our clients is that the problem stems from a lack of connection between data governance and the operational reality of the IT system. One documents. The other moves forward. And the two don’t communicate enough.” - Eric Draperi, Smoteo cofounder

PPM, BI, Agile, Delivery : Which Tool(s) Are Right for Your Governance?

Download the Guide

Beyond GDPR: the real stakes for CIOs and large organizations

GDPR had the merit of forcing organizations to ask questions they had been avoiding. Who owns this data? Where is it stored? Who has access? These questions shaped a first generation of governance programs - often compliance-driven, sometimes at the expense of value creation.

The regulatory landscape has since grown considerably denser:

  • The AI Act, a cornerstone of Europe's artificial intelligence strategy, now requires organizations to guarantee the traceability, reliability, and bias-free nature of data feeding AI technologies.

  • The Data Act redefines access and sharing rights for data generated by connected objects and services.

  • DORA strengthens informational resilience requirements in the financial sector.

Compliance is no longer a one-time project: it's a permanent operating condition.

But for CIOs and transformation leaders, the stakes go well beyond regulatory compliance. The real questions are strategic: does the available data enable fast decisions, precise project scoping, and anticipation of how a transformation will affect the IS?

Effective data governance reduces the time it takes to frame IT projects, makes budget arbitrations more reliable, and gives architectural impact analyses the credibility they need in steering committees. Poorly integrated data governance, on the other hand, produces dashboards nobody looks at, and leaves decisions resting on shaky foundations.

What does effective data governance actually look like in practice?

Quality, integrity, traceability: the three non-negotiable foundations

Data governance rests on three technical pillars that are, in practice, inseparable.

Data quality refers to the accuracy, completeness, and consistency of data across the systems that consume it. Quality data is data a decision can rely on without needing to verify it first. In a complex information system, ensuring data quality requires domain-level validation rules, automated controls, and clearly assigned ownership (a data owner) for each scope.

Data integrity goes further: it guarantees that data hasn't been altered (intentionally or not) through its various transformations. When a piece of data moves from a delivery tool to a reporting tool, then to an executive dashboard, each transformation is a potential point of corruption. Integrity means that chain is documented, controlled, and auditable.

Data traceability (or data lineage) is the ability to reconstruct a data point's journey from its source to its point of use. Regulations now require it (AI Act, GDPR), but its operational value is just as significant: when a figure gets challenged in a steering committee, lineage lets you resolve the dispute in seconds rather than three days of consolidation work.

These three principles don't work in isolation. Traceable data that's poor quality is still unusable. Quality data without documented lineage can't be defended from a regulatory standpoint. It's the combination of all three that forms the foundation of an operational governance policy.

The role of the Chief Data Officer (and the limits of what that role can carry alone)

The CDO embodies information governance at the leadership level. They define the data strategy, drive quality and compliance programs, arbitrate priorities across data domains, and champion a data culture across the organization. In large organizations, their legitimacy has grown significantly with the pressure of regulation and the rise of AI.

But the CDO can't carry everything. Their effectiveness depends directly on the quality of the structures feeding them information (data stewards, data owners, business teams…) and on the organization's ability to align its practices around a shared framework. Without collective adoption, the CDO has a strategy with nothing to grip. They govern what they can see. And in a fragmented IS, what they see is rarely the full picture.

That's precisely where the role hits its structural ceiling. Data governance carried by a single function, however well-resourced, remains partial governance. It doesn't naturally connect to architecture decisions, portfolio arbitrations, or delivery choices. To become a real management lever, it has to work in concert with the other dimensions of the IS. And that's where enterprise architecture comes in.

"What I see in many organizations: the CDO has a strategy. Teams have their tools. And in between, there's a gap. Nobody governs the dependencies. Nobody sees the IS as it actually is. That's a colossal waste of talent." - Eric Draperi, Smoteo cofounder

Enterprise Architecture as the backbone of data governance

The Enterprise Architect has a vantage point few others in the IT organization share: interdependencies. They see how applications connect, how data flows between systems, how a local technical choice creates an invisible dependency downstream. That position makes them a natural actor in data governance… as long as their work doesn't stay buried in a SharePoint.

The recurring problem is well known: architecture maps get produced, documented, and then go stale before anyone actually uses them. The IS evolves faster than the reference systems meant to describe it. The Enterprise Architecture Lead ends up reconstructing the current state rather than anticipating future trajectories. And their impact analyses, critical as they are to major decisions, don't make it into the room at the right moment.

Effective data governance changes that equation. It gives enterprise architecture the means to work from a living picture of the Information System, not a static snapshot. Data flows are mapped in connection with business processes and ongoing initiatives. Dependencies are visible before projects create them. And architecture decisions are grounded in a shared reality, not manual extractions whose accuracy will be disputed in the next meeting.

Data governance that doesn't talk to architecture only governs part of what flows through the IS. Architecture that doesn't draw from data governance is working blind on its own reference systems. The two disciplines need each other. And their integration is precisely what most large organizations are missing.

Why AI agents are rewriting the rules of data governance

When agents consume and act on data without human supervision

For years, data governance was designed to manage human usage. Users access data, interpret it, make decisions. Quality rules, access rights, validation procedures — the entire framework assumed a human in the loop, capable of spotting an anomaly, questioning a figure, or refusing an inconsistent action.

AI agents fundamentally change that equation. An agent doesn't read data: it acts on it. It can query a CRM, cross-reference financial data, trigger a purchase order in an ERP, or modify an ITSM workflow, autonomously, at a speed and scale no human operator can supervise in real time.

And unlike a user who hesitates when something looks off, an agent amplifies inconsistencies at scale. Poor-quality data no longer produces a one-off error. It propagates through every automated decision that depends on it.

The problem isn't the agent itself. It's the speed at which it embeds itself in the IS before any framework is in place. Business teams run experiments. Leadership launches proofs of concept. Automations go into production without IT being in the loop. The CIO ends up in an uncomfortable position:

  • Govern too tightly, and you slow down a transformation the executive committee is expecting.

  • Govern too loosely, and you discover (after the fact) agents running in production on sensitive data, with unchecked permissions and no audit trail of the actions they've taken.

"We see it in organizations everywhere: traditional governance is structurally too slow. Agents get deployed by business teams on customer or financial data, without IT in the loop. Not out of bad intent. Out of speed." - Eric Draperi, Smoteo cofounder

The new requirements: lineage, action rights, traceability, sovereignty

Agentic AI forces a rethink of four dimensions of data governance that already existed, but were never designed for this level of autonomy.

Data lineage (where data comes from, how it's been transformed, in what context it's being used) becomes an operational requirement as much as a regulatory one. An agent making a decision must be able to explain which data it relied on, where that data originated, and whether it was appropriate for the use case. Without documented lineage, there's no way to audit an automated decision, and no way to detect a drift before it produces real consequences.

Action rights are the new frontier of governance. Defining who has access to what data is no longer enough. You need to define what an agent is allowed to do with that data (read, modify, trigger, delete), and under what conditions human validation is required. This is a granular, operation-level control logic that traditional governance frameworks never anticipated.

The traceability of each agent's actions (access, modification, trigger, timestamp, policy context) must be logged in a tamper-proof way. The AI Act requires this for high-risk systems. But beyond compliance, it's what enables a CIO to answer, in a steering committee, the question: what exactly are the agents deployed in our IS doing, and what data are they acting on?

Finally, data sovereignty takes on a new dimension. When an agent relies on a model hosted outside the European Union and processes customer or financial data, questions of data residency, exposure, and protection become matters of architecture as much as compliance.

Privacy and digital sovereignty are no longer peripheral concerns — they sit at the heart of any governance plan for organizations deploying agents at scale. Those that haven't anticipated this dimension are discovering contractual commitments made by their business teams with no prior risk analysis.

How Smoteo structures AI agent governance

As agents proliferate across the IS, Smoteo offers a supervision framework built around three complementary capabilities.

  • The AI Value Stream Map visually maps every agent deployed across the business and IT ecosystem: its function, the data it consumes, the processes it touches, the systems it interacts with.

  • The usage registry continuously tracks active AI initiatives by department — including those launched without formal approval — giving the CIO genuine visibility into what's actually running in the IS.

  • Smoteo's integrated AI agent governance frames each initiative through dedicated Epic Cards: objectives, dependencies, action rights, risks, and expected business value.

With Smoteo, agents move forward within a defined framework - not outside of it.

How to build data governance that's genuinely integrated with IT transformation

Anchoring governance in the living IS

The first mistake in most data management programs is architectural: they build reference systems alongside the IS rather than inside it. A manually maintained data catalog, a data flow map updated quarterly, a business glossary nobody consults before making a decision - these deliverables have documentary value, not operational value.

The problem isn't the rigor of the people producing them. It's that the IS evolves faster than the update cycles. A project launches, creates a new dependency between two systems, pulls sensitive data into an unmapped flow, and governance finds out three months later, during the next audit. The information exists. It's in Jira, in an email, in a Slack message. But it's never available at the moment a decision is being made.

Integrated data governance flips that logic:

  • It connects to existing delivery tools to continuously feed its reference systems — with no additional burden on teams.

  • Project statuses update automatically.

  • New application dependencies are detected at the scoping stage.

  • Emerging data flows are visible before they go into production.

This is no longer governance that documents what happened: it's governance that sees what's happening, and enables action before problems become incidents.

For the CIO Office, this shift is structural. It transforms how governance committees are prepared: instead of three days of manual consolidation to produce a partial summary, data is available in real time, from a single source everyone agrees on. With Smoteo, the resulting reduction in IT project scoping time can reach 30 to 50% — not because processes have changed, but because information no longer needs to be reconstructed from scratch each time.

Turning regulatory constraint into a steering framework

Regulatory compliance is typically treated as an external obligation. A set of requirements imposed by legislation, to be met with as little friction as possible. This posture produces one-off compliance programs, usually owned by the DPO or legal teams, disconnected from the actual IT transformation.

The current regulatory environment makes that approach unsustainable:

  • GDPR requires permanent data protection, particularly around personal data confidentiality.

  • The AI Act imposes documentation obligations on data feeding high-risk systems (origin, quality, absence of bias) along with audit logs of their decisions.

  • The Data Act redefines portability and sharing rights across scopes organizations hadn't anticipated.

These three frameworks don't apply sequentially - they coexist, overlap, and frequently concern the same data.

Organizations that have turned this constraint into a lever have adopted a different logic: rather than treating compliance as an after-the-fact check, they embed it as a native dimension of their data governance. Regulatory requirements become quality rules, scoping criteria, guardrails built into the delivery process. Compliance is no longer an audit: it's a permanent state, maintained by governance itself. This shift significantly reduces the risk of pre-production non-compliance, without adding verification overhead to operational teams.

It also changes the conversation with the executive committee. A CIO who can demonstrate that their AI initiatives are traceable, auditable, and compliant from the start isn't defending a budget. They're demonstrating control and trustworthiness. And in an environment where AI incidents are now reputational risks as much as compliance risks, that posture carries real strategic value.

What tools and practices does large-scale operational data governance require?

Data catalogs, data stewardship, data contracts

Three structural practices set apart organizations that have moved their data governance from framework to operation.

A data catalog is the centralized inventory of an organization's data assets: what exists, where it's stored, who owns it, how it's used, what quality rules apply. Built well, it's the common surface on which data, IT, and business teams can align. Built poorly (meaning: maintained manually, fed through periodic collection campaigns) it's outdated before it's useful. The key isn't the tool: it's the feeding model. A living catalog connects to source systems and updates continuously. A documentary catalog ages at the speed of the IS.

Data stewardship is the human organization that gives governance its substance. Data stewards (whether from the business side, IT, or both working in tandem as the most mature practices suggest) are the guardians of data quality and consistency within their domain. They identify anomalies, resolve definition conflicts, validate usage. Without them, governance remains a set of rules with no one to carry them. With them, it becomes a collective practice. The 2026 trend is clear: the most advanced organizations are no longer choosing between centralized and decentralized models. They operate in hybrid mode, with a central team setting the standards and business stewards embodying them in their own domains.

"A data steward without a living reference system is someone who spends their day chasing a reality that changes faster than their map. You're asking them to govern, without giving them the means to see what they're supposed to be governing." - Eric Draperi, Smoteo cofounder

Data contracts represent the next level of maturity. A data contract is a formal agreement between a data producer and its consumers: it defines the schema, quality standards, freshness SLAs, and conditions for change. In an IS where data flows across dozens of systems (and now between AI agents) this formalization becomes an operational necessity. It allows quality anomalies to be caught at the source rather than at the end of the chain, when the damage is already done.

From fixed framework to living model: data governance as an adaptive system

The structural limitation of most data governance programs isn't technical ; it's conceptual. These programs were designed as projects with an end date, producing a deliverable (a reference system, a framework, a policy) meant to hold over time. But data doesn't hold over time. Neither does the IS. And transformation, by definition, doesn't stop.

Large-scale operational data governance isn't a state to reach. It's a system that continuously adapts to the reality of the IS - to new projects, new dependencies, new AI use cases, new regulatory requirements.

That system needs three properties that traditional approaches don't guarantee:

  • Connection to reality: its reference systems feed from delivery platforms, not from manual data entry campaigns.

  • Cross-functional readability: its information is accessible and understandable to architects, PMOs, business stakeholders, and leadership - not just data teams.

  • Decision-making capacity: it doesn't just document what exists, it enables impact anticipation, initiative scoping, and objective arbitration.

This is precisely the difference between data governance that produces reporting and data governance that produces control. The first answers the question: what exists? The second answers: what's happening, what does it mean, and what needs to be decided right now?

"Data governance that produces reporting tells you what was true three days ago. Governance that produces control tells you what's true now, and what needs to be decided. They're not the same thing." - Eric Draperi, Smoteo cofounder

What's it like with Smoteo?

Smoteo makes this living governance logic concrete through a meta-model: a configurable governance model that connects strategy, architecture, portfolio, and delivery in a single surface.

Unlike a static reference system, Smoteo's meta-model feeds continuously from existing tools (Jira, CMDB platforms like ServiceNow…) without asking teams to change how they work. Every change in the IS is reflected in the model. Every new initiative is scoped in relation to existing dependencies and strategic objectives.

Smoteo's integrated portfolio orchestration then enables the full set of IT and AI initiatives to be managed as a coherent portfolio, not a collection of parallel projects. Arbitrations are grounded in a shared, reliable picture of the IS. The results? Mapped coverage of over 80% of applications, processes, and agents; 25 additional strategic decisions supported per quarter; and a 40% reduction in response time when unexpected events occur.

This is no longer data governance that documents the IS. It's data governance that drives it.

Data governance: what now?

Data governance has reached a tipping point. Organizations that invested in structured programs built solid foundations - but ones designed for a stable IS, consumed by humans, in a predictable regulatory environment. That context no longer exists.

Agentic AI has exposed the limits of governance processes built as documentary frameworks rather than management systems. The answer isn't to govern more, it's to govern differently.

Well-governed data doesn't just need to be reliable. It needs to make the organization capable of deciding fast, transforming without losing coherence, and deploying AI without losing control.

Are you leading IT transformation in a complex organization and looking to connect your data governance to your operational reality? Request your personalized Smoteo demo, and take control of your data - centralized, optimized, and always in sync.

Comparative Guide

PPM, BI, Agile, Delivery : Which Tool(s) Are Right for Your Governance?

Compare the options available to you to accelerate value creation in your organization starting tomorrow.

Data governance has become a boardroom topic. CIOs are dedicating budgets, teams, and steering committees to it. Frameworks exist. Policies too. And yet, even in the most structured organizations, the same frustration keeps surfacing: the data is there, but nobody is reading the same reality.

This isn't a maturity problem - it's an integration problem. When data governance lives in its own tools, disconnected from Architecture, project portfolios, and delivery, it isn't really governing anything: it's just documenting.

Meanwhile, IT transformation keeps moving. AI agents are entering the information system without waiting for the frameworks to be ready. And decisions get made, with or without the full picture.

This article shows you how to turn data governance into an integrated steering lever, not just another reference layer running parallel to your transformation.

In a nutshell

Organizations today have established data governance frameworks. However, the rapid advancement of agent-based AI is exposing a structural limitation: governance that is disconnected from Architecture and delivery is no longer sufficient to control what is actually being deployed within the organization. This article explores how to take your data governance strategy to the next level by truly integrating it into your transformation efforts.

What is data governance, and where does it actually stop?

Defining data governance and its real scope

Data governance refers to the full set of policies, roles, and processes that frame how an organization manages its data throughout its entire lifecycle - from collection to deletion, through storage, processing, and sharing.

Its core purpose: ensuring that enterprise data is reliable, accessible to the right people, compliant with regulatory requirements, and usable in decision-making.

That scope is broader than it looks. Data governance isn't just about database quality or GDPR compliance. It covers data traceability across the information system, access rights definition, metadata management, domain-level ownership… and, increasingly, oversight of automated usage by AI models and agents. In that sense, it touches both data security and data-driven value creation for the business.

What makes it complex in large organizations is the sheer scale of what needs to be governed: an information system spanning hundreds of applications, data flowing through dozens of business processes, teams working with their own tools and reference systems. All of which confirms that data governance is, structurally, a cross-functional consistency problem. And cross-functional consistency can't be mandated into existence.

“What we see with our clients is that the problem stems from a lack of connection between data governance and the operational reality of the IT system. One documents. The other moves forward. And the two don’t communicate enough.” - Eric Draperi, Smoteo cofounder

PPM, BI, Agile, Delivery : Which Tool(s) Are Right for Your Governance?

Download the Guide

Beyond GDPR: the real stakes for CIOs and large organizations

GDPR had the merit of forcing organizations to ask questions they had been avoiding. Who owns this data? Where is it stored? Who has access? These questions shaped a first generation of governance programs - often compliance-driven, sometimes at the expense of value creation.

The regulatory landscape has since grown considerably denser:

  • The AI Act, a cornerstone of Europe's artificial intelligence strategy, now requires organizations to guarantee the traceability, reliability, and bias-free nature of data feeding AI technologies.

  • The Data Act redefines access and sharing rights for data generated by connected objects and services.

  • DORA strengthens informational resilience requirements in the financial sector.

Compliance is no longer a one-time project: it's a permanent operating condition.

But for CIOs and transformation leaders, the stakes go well beyond regulatory compliance. The real questions are strategic: does the available data enable fast decisions, precise project scoping, and anticipation of how a transformation will affect the IS?

Effective data governance reduces the time it takes to frame IT projects, makes budget arbitrations more reliable, and gives architectural impact analyses the credibility they need in steering committees. Poorly integrated data governance, on the other hand, produces dashboards nobody looks at, and leaves decisions resting on shaky foundations.

What does effective data governance actually look like in practice?

Quality, integrity, traceability: the three non-negotiable foundations

Data governance rests on three technical pillars that are, in practice, inseparable.

Data quality refers to the accuracy, completeness, and consistency of data across the systems that consume it. Quality data is data a decision can rely on without needing to verify it first. In a complex information system, ensuring data quality requires domain-level validation rules, automated controls, and clearly assigned ownership (a data owner) for each scope.

Data integrity goes further: it guarantees that data hasn't been altered (intentionally or not) through its various transformations. When a piece of data moves from a delivery tool to a reporting tool, then to an executive dashboard, each transformation is a potential point of corruption. Integrity means that chain is documented, controlled, and auditable.

Data traceability (or data lineage) is the ability to reconstruct a data point's journey from its source to its point of use. Regulations now require it (AI Act, GDPR), but its operational value is just as significant: when a figure gets challenged in a steering committee, lineage lets you resolve the dispute in seconds rather than three days of consolidation work.

These three principles don't work in isolation. Traceable data that's poor quality is still unusable. Quality data without documented lineage can't be defended from a regulatory standpoint. It's the combination of all three that forms the foundation of an operational governance policy.

The role of the Chief Data Officer (and the limits of what that role can carry alone)

The CDO embodies information governance at the leadership level. They define the data strategy, drive quality and compliance programs, arbitrate priorities across data domains, and champion a data culture across the organization. In large organizations, their legitimacy has grown significantly with the pressure of regulation and the rise of AI.

But the CDO can't carry everything. Their effectiveness depends directly on the quality of the structures feeding them information (data stewards, data owners, business teams…) and on the organization's ability to align its practices around a shared framework. Without collective adoption, the CDO has a strategy with nothing to grip. They govern what they can see. And in a fragmented IS, what they see is rarely the full picture.

That's precisely where the role hits its structural ceiling. Data governance carried by a single function, however well-resourced, remains partial governance. It doesn't naturally connect to architecture decisions, portfolio arbitrations, or delivery choices. To become a real management lever, it has to work in concert with the other dimensions of the IS. And that's where enterprise architecture comes in.

"What I see in many organizations: the CDO has a strategy. Teams have their tools. And in between, there's a gap. Nobody governs the dependencies. Nobody sees the IS as it actually is. That's a colossal waste of talent." - Eric Draperi, Smoteo cofounder

Enterprise Architecture as the backbone of data governance

The Enterprise Architect has a vantage point few others in the IT organization share: interdependencies. They see how applications connect, how data flows between systems, how a local technical choice creates an invisible dependency downstream. That position makes them a natural actor in data governance… as long as their work doesn't stay buried in a SharePoint.

The recurring problem is well known: architecture maps get produced, documented, and then go stale before anyone actually uses them. The IS evolves faster than the reference systems meant to describe it. The Enterprise Architecture Lead ends up reconstructing the current state rather than anticipating future trajectories. And their impact analyses, critical as they are to major decisions, don't make it into the room at the right moment.

Effective data governance changes that equation. It gives enterprise architecture the means to work from a living picture of the Information System, not a static snapshot. Data flows are mapped in connection with business processes and ongoing initiatives. Dependencies are visible before projects create them. And architecture decisions are grounded in a shared reality, not manual extractions whose accuracy will be disputed in the next meeting.

Data governance that doesn't talk to architecture only governs part of what flows through the IS. Architecture that doesn't draw from data governance is working blind on its own reference systems. The two disciplines need each other. And their integration is precisely what most large organizations are missing.

Why AI agents are rewriting the rules of data governance

When agents consume and act on data without human supervision

For years, data governance was designed to manage human usage. Users access data, interpret it, make decisions. Quality rules, access rights, validation procedures — the entire framework assumed a human in the loop, capable of spotting an anomaly, questioning a figure, or refusing an inconsistent action.

AI agents fundamentally change that equation. An agent doesn't read data: it acts on it. It can query a CRM, cross-reference financial data, trigger a purchase order in an ERP, or modify an ITSM workflow, autonomously, at a speed and scale no human operator can supervise in real time.

And unlike a user who hesitates when something looks off, an agent amplifies inconsistencies at scale. Poor-quality data no longer produces a one-off error. It propagates through every automated decision that depends on it.

The problem isn't the agent itself. It's the speed at which it embeds itself in the IS before any framework is in place. Business teams run experiments. Leadership launches proofs of concept. Automations go into production without IT being in the loop. The CIO ends up in an uncomfortable position:

  • Govern too tightly, and you slow down a transformation the executive committee is expecting.

  • Govern too loosely, and you discover (after the fact) agents running in production on sensitive data, with unchecked permissions and no audit trail of the actions they've taken.

"We see it in organizations everywhere: traditional governance is structurally too slow. Agents get deployed by business teams on customer or financial data, without IT in the loop. Not out of bad intent. Out of speed." - Eric Draperi, Smoteo cofounder

The new requirements: lineage, action rights, traceability, sovereignty

Agentic AI forces a rethink of four dimensions of data governance that already existed, but were never designed for this level of autonomy.

Data lineage (where data comes from, how it's been transformed, in what context it's being used) becomes an operational requirement as much as a regulatory one. An agent making a decision must be able to explain which data it relied on, where that data originated, and whether it was appropriate for the use case. Without documented lineage, there's no way to audit an automated decision, and no way to detect a drift before it produces real consequences.

Action rights are the new frontier of governance. Defining who has access to what data is no longer enough. You need to define what an agent is allowed to do with that data (read, modify, trigger, delete), and under what conditions human validation is required. This is a granular, operation-level control logic that traditional governance frameworks never anticipated.

The traceability of each agent's actions (access, modification, trigger, timestamp, policy context) must be logged in a tamper-proof way. The AI Act requires this for high-risk systems. But beyond compliance, it's what enables a CIO to answer, in a steering committee, the question: what exactly are the agents deployed in our IS doing, and what data are they acting on?

Finally, data sovereignty takes on a new dimension. When an agent relies on a model hosted outside the European Union and processes customer or financial data, questions of data residency, exposure, and protection become matters of architecture as much as compliance.

Privacy and digital sovereignty are no longer peripheral concerns — they sit at the heart of any governance plan for organizations deploying agents at scale. Those that haven't anticipated this dimension are discovering contractual commitments made by their business teams with no prior risk analysis.

How Smoteo structures AI agent governance

As agents proliferate across the IS, Smoteo offers a supervision framework built around three complementary capabilities.

  • The AI Value Stream Map visually maps every agent deployed across the business and IT ecosystem: its function, the data it consumes, the processes it touches, the systems it interacts with.

  • The usage registry continuously tracks active AI initiatives by department — including those launched without formal approval — giving the CIO genuine visibility into what's actually running in the IS.

  • Smoteo's integrated AI agent governance frames each initiative through dedicated Epic Cards: objectives, dependencies, action rights, risks, and expected business value.

With Smoteo, agents move forward within a defined framework - not outside of it.

How to build data governance that's genuinely integrated with IT transformation

Anchoring governance in the living IS

The first mistake in most data management programs is architectural: they build reference systems alongside the IS rather than inside it. A manually maintained data catalog, a data flow map updated quarterly, a business glossary nobody consults before making a decision - these deliverables have documentary value, not operational value.

The problem isn't the rigor of the people producing them. It's that the IS evolves faster than the update cycles. A project launches, creates a new dependency between two systems, pulls sensitive data into an unmapped flow, and governance finds out three months later, during the next audit. The information exists. It's in Jira, in an email, in a Slack message. But it's never available at the moment a decision is being made.

Integrated data governance flips that logic:

  • It connects to existing delivery tools to continuously feed its reference systems — with no additional burden on teams.

  • Project statuses update automatically.

  • New application dependencies are detected at the scoping stage.

  • Emerging data flows are visible before they go into production.

This is no longer governance that documents what happened: it's governance that sees what's happening, and enables action before problems become incidents.

For the CIO Office, this shift is structural. It transforms how governance committees are prepared: instead of three days of manual consolidation to produce a partial summary, data is available in real time, from a single source everyone agrees on. With Smoteo, the resulting reduction in IT project scoping time can reach 30 to 50% — not because processes have changed, but because information no longer needs to be reconstructed from scratch each time.

Turning regulatory constraint into a steering framework

Regulatory compliance is typically treated as an external obligation. A set of requirements imposed by legislation, to be met with as little friction as possible. This posture produces one-off compliance programs, usually owned by the DPO or legal teams, disconnected from the actual IT transformation.

The current regulatory environment makes that approach unsustainable:

  • GDPR requires permanent data protection, particularly around personal data confidentiality.

  • The AI Act imposes documentation obligations on data feeding high-risk systems (origin, quality, absence of bias) along with audit logs of their decisions.

  • The Data Act redefines portability and sharing rights across scopes organizations hadn't anticipated.

These three frameworks don't apply sequentially - they coexist, overlap, and frequently concern the same data.

Organizations that have turned this constraint into a lever have adopted a different logic: rather than treating compliance as an after-the-fact check, they embed it as a native dimension of their data governance. Regulatory requirements become quality rules, scoping criteria, guardrails built into the delivery process. Compliance is no longer an audit: it's a permanent state, maintained by governance itself. This shift significantly reduces the risk of pre-production non-compliance, without adding verification overhead to operational teams.

It also changes the conversation with the executive committee. A CIO who can demonstrate that their AI initiatives are traceable, auditable, and compliant from the start isn't defending a budget. They're demonstrating control and trustworthiness. And in an environment where AI incidents are now reputational risks as much as compliance risks, that posture carries real strategic value.

What tools and practices does large-scale operational data governance require?

Data catalogs, data stewardship, data contracts

Three structural practices set apart organizations that have moved their data governance from framework to operation.

A data catalog is the centralized inventory of an organization's data assets: what exists, where it's stored, who owns it, how it's used, what quality rules apply. Built well, it's the common surface on which data, IT, and business teams can align. Built poorly (meaning: maintained manually, fed through periodic collection campaigns) it's outdated before it's useful. The key isn't the tool: it's the feeding model. A living catalog connects to source systems and updates continuously. A documentary catalog ages at the speed of the IS.

Data stewardship is the human organization that gives governance its substance. Data stewards (whether from the business side, IT, or both working in tandem as the most mature practices suggest) are the guardians of data quality and consistency within their domain. They identify anomalies, resolve definition conflicts, validate usage. Without them, governance remains a set of rules with no one to carry them. With them, it becomes a collective practice. The 2026 trend is clear: the most advanced organizations are no longer choosing between centralized and decentralized models. They operate in hybrid mode, with a central team setting the standards and business stewards embodying them in their own domains.

"A data steward without a living reference system is someone who spends their day chasing a reality that changes faster than their map. You're asking them to govern, without giving them the means to see what they're supposed to be governing." - Eric Draperi, Smoteo cofounder

Data contracts represent the next level of maturity. A data contract is a formal agreement between a data producer and its consumers: it defines the schema, quality standards, freshness SLAs, and conditions for change. In an IS where data flows across dozens of systems (and now between AI agents) this formalization becomes an operational necessity. It allows quality anomalies to be caught at the source rather than at the end of the chain, when the damage is already done.

From fixed framework to living model: data governance as an adaptive system

The structural limitation of most data governance programs isn't technical ; it's conceptual. These programs were designed as projects with an end date, producing a deliverable (a reference system, a framework, a policy) meant to hold over time. But data doesn't hold over time. Neither does the IS. And transformation, by definition, doesn't stop.

Large-scale operational data governance isn't a state to reach. It's a system that continuously adapts to the reality of the IS - to new projects, new dependencies, new AI use cases, new regulatory requirements.

That system needs three properties that traditional approaches don't guarantee:

  • Connection to reality: its reference systems feed from delivery platforms, not from manual data entry campaigns.

  • Cross-functional readability: its information is accessible and understandable to architects, PMOs, business stakeholders, and leadership - not just data teams.

  • Decision-making capacity: it doesn't just document what exists, it enables impact anticipation, initiative scoping, and objective arbitration.

This is precisely the difference between data governance that produces reporting and data governance that produces control. The first answers the question: what exists? The second answers: what's happening, what does it mean, and what needs to be decided right now?

"Data governance that produces reporting tells you what was true three days ago. Governance that produces control tells you what's true now, and what needs to be decided. They're not the same thing." - Eric Draperi, Smoteo cofounder

What's it like with Smoteo?

Smoteo makes this living governance logic concrete through a meta-model: a configurable governance model that connects strategy, architecture, portfolio, and delivery in a single surface.

Unlike a static reference system, Smoteo's meta-model feeds continuously from existing tools (Jira, CMDB platforms like ServiceNow…) without asking teams to change how they work. Every change in the IS is reflected in the model. Every new initiative is scoped in relation to existing dependencies and strategic objectives.

Smoteo's integrated portfolio orchestration then enables the full set of IT and AI initiatives to be managed as a coherent portfolio, not a collection of parallel projects. Arbitrations are grounded in a shared, reliable picture of the IS. The results? Mapped coverage of over 80% of applications, processes, and agents; 25 additional strategic decisions supported per quarter; and a 40% reduction in response time when unexpected events occur.

This is no longer data governance that documents the IS. It's data governance that drives it.

Data governance: what now?

Data governance has reached a tipping point. Organizations that invested in structured programs built solid foundations - but ones designed for a stable IS, consumed by humans, in a predictable regulatory environment. That context no longer exists.

Agentic AI has exposed the limits of governance processes built as documentary frameworks rather than management systems. The answer isn't to govern more, it's to govern differently.

Well-governed data doesn't just need to be reliable. It needs to make the organization capable of deciding fast, transforming without losing coherence, and deploying AI without losing control.

Are you leading IT transformation in a complex organization and looking to connect your data governance to your operational reality? Request your personalized Smoteo demo, and take control of your data - centralized, optimized, and always in sync.

Comparative Guide

PPM, BI, Agile, Delivery : Which Tool(s) Are Right for Your Governance?

About the Author

Eric Draperi

Cofounder @ Smoteo

I’ve spent most of my career making sense of complex information systems. I started out as an omnichannel architect, working with organizations facing a familiar challenge: connecting business and IT without sacrificing agility or clarity. I’ve been involved in multiple digital transformations, always driven by the same belief: an architecture only matters if it truly supports strategy and value creation.

Social Icon
About the Author

Eric Draperi

Cofounder @ Smoteo

I’ve spent most of my career making sense of complex information systems. I started out as an omnichannel architect, working with organizations facing a familiar challenge: connecting business and IT without sacrificing agility or clarity. I’ve been involved in multiple digital transformations, always driven by the same belief: an architecture only matters if it truly supports strategy and value creation.

Social Icon

Everyone Drives Change, Smoteo Connects the Dots

Whatever your role - CIO, Architect, PMO, or Product Owner - we've got your back

Everyone Drives Change, Smoteo Connects the Dots

Whatever your role - CIO, Architect, PMO, or Product Owner - we've got your back