Yardi Data Architecture for AI: Data Connect, Replicate & YDMS Explained | Assetsoft

09.05.26 01:59 AM Comment(s) By Assetsoft

How Data Connect, Replicate, and YDMS Build the Foundation Virtuoso Runs On

31,000+

Microsoft Fabric customer organizations

31,000+

Microsoft Fabric customer organizations

31,000+

Microsoft Fabric customer organizations

The numbers above describe a market that has already made its decision. Microsoft Fabric is no longer an early-adopter platform; it is the substrate on which most enterprise AI workloads will be built. For the real estate sector specifically, that decision has direct consequences for how Yardi data is structured, surfaced, and made available to the next generation of AI tools.

At the Yardi Executive Briefing in Santa Barbara during the week of May 4, 2026, a Yardi data strategy session led by Nagesh Nagaraj walked consultants through the three Yardi products that operationalize this reality: Yardi Data Connect, Yardi Replicate, and Yardi Document Management for SharePoint. The companion pillar post in this series introduced the framework that organizes them. This deep dive examines the architecture beneath each.

Three Products, Three Architectures, One Foundation

The three Yardi data and content products serve different downstream environments and operate under different constraints. The table below summarizes how they actually work, not how they are marketed.

Yardi product	What it actually does	Where it lives
Yardi Data Connect	Query-driven API exposing Voyager data through SQL-style scripted reports into Power BI dataflows and a semantic model.	Microsoft Power BI / Microsoft Fabric — no private cloud required
Yardi Replicate	Real-time Change Data Capture of raw Voyager tables, powered by Qlik and Microsoft SQL CDC	Snowflake, AWS, or Azure (per Yardi) — Yardi private cloud only
Yardi Document Management for SharePoint	Replaces Yardi's proprietary document repository with SharePoint Online; auto-tags Voyager metadata; inherits Yardi security	Microsoft 365 / SharePoint Online

The pillar post covers when to choose which. The sections below cover how each is engineered.

Yardi Data Connect: the SQL-report-to-Power-BI pipeline

Most consultants engaging with YDC have never traced the actual data path from Voyager through to a published Power BI dashboard. The path matters because it determines what is possible at each layer and where governance has to be established.

The pipeline

YDC begins at the Yardi live database, the same database that Voyager and its ancillary products (leasing, accounting, maintenance, the Cafe portals) read and write to during normal operation. From that database, YDC uses SQL-style scripted reports as its data-extraction primitive. This is the same construct as YSR reports, which means existing standard reports can be repointed through YDC rather than rewritten.

Reports flow through an API user with administration controls applied. The API user is the governance boundary; access scope, refresh cadence, and audit are all controlled at this layer. From there, data lands in Power BI dataflows, which feed a semantic model. The model publishes as visual reports, paginated reports, and dashboards. Other connectors, Salesforce, finance systems, files, and applications can be added to the same workspace and joined into the same model.

The security model is extended, not just inherited.

Per Yardi's product documentation, YDC honors Voyager's property-level security and adds row-level and workspace-level protections in Power BI. The Voyager security model is preserved, and the Microsoft layer adds finer-grained controls on top of it. For multi-portfolio operators with overlapping access requirements, this two-layer model is materially different from a generic data-extract-and-transform approach.

What this enables at the AI layer

Once data sits in the Microsoft Power BI / Fabric workspace, the broader Microsoft AI stack becomes available against the same model: Microsoft Copilot, Power Automate, machine learning workloads, and Fabric data agents. The semantic model built for dashboards serves as the grounding layer for AI agents, the same artifact used twice.

Yardi Replicate: real-time CDC for non-Microsoft warehouses

Replicate exists for a specific class of operators: those who have already invested in Snowflake, AWS, or Microsoft Azure and want raw Voyager data within those environments without rebuilding their analytics stack on Microsoft.

How it works

Per Yardi's official product documentation, Replicate is powered by Qlik and Microsoft Change Data Capture. The Qlik replication engine, combined with Microsoft SQL CDC, detects row-level changes in Voyager and streams them in near-real time to the destination warehouse. After an initial bulk load, only changed records flow downstream.

Operational characteristics

• First-load throughput is large by design; billions of rows can be moved within a small number of hours, depending on portfolio size

• Steady-state load is low, only deltas are streamed, which means no nightly batch window and minimal production impact

• Replication tasks are configured per-table with control over schedule, target endpoint, and operational mode (real-time, batch, or hybrid)

• Tasks can be started, paused, or stopped; logs capture lag, errors, and exception conditions

Supported endpoints

Per Yardi's official product documentation, Yardi Replicate works with Snowflake, Amazon Web Services (AWS), and Microsoft Azure. The underlying Qlik replication engine supports a broader range of targets in other Qlik deployments. Still, any target outside Yardi's published list should be scoped directly with Yardi before client commitments are made.

Replicate is available exclusively to Yardi private cloud clients. Operators on standard hosting cannot deploy it.

Why ontology, not just BI, is the AI unlock

In the Yardi data strategy session, the segment that drew the most attention from consultants in the room was on ontology, a concept few of them had encountered in the context of real estate technology. The argument is straightforward and consequential.

Tabular models describe what data is. Ontologies describe what data means.

A relational schema specifies that a tenant table joins a lease table via a foreign key. It does not tell a Large Language Model that anchor tenant implies a specific lease type with concentration-risk implications, that lease termination triggers downstream GL entries and audit obligations, or that CAM reconciliation is a year-end calculation governed by lease-specific clauses. Ontologies encode that semantic context.

Microsoft Fabric IQ Ontology: the practical implementation

Microsoft has implemented this concept in Fabric IQ Ontology (preview), part of the Fabric IQ workload. Per Microsoft Learn, the implementation flow is:

1. Build (or import) a Power BI semantic model

2. Generate an Ontology item from the semantic model, or build directly from OneLake-backed lakehouse tables

3. Bind entity types (Property, Lease, Tenant, GLEntry, etc.) to data, define properties, and configure relationship types

4. Once bound, downstream tools, Fabric data agents, Copilot, and custom AI agents query the ontology in natural language and return answers grounded in the underlying data

Microsoft documents specific constraints that should be flagged with clients: ontology supports managed lakehouse tables only (not external tables), and there are column-mapping limitations on delta tables with certain special characters in column names. The feature is in preview, not general availability.

The strategic implication for Yardi clients

Every Power BI semantic model built on YDC today serves a dual purpose: as an operational reporting deliverable and as the substrate for a future ontology. The relationships, calculated measures, and security boundaries baked into the semantic model for a quarterly NOI dashboard are the same context an LLM will require to answer a Copilot question about that NOI six months later. Skipping the modeling discipline today does not just produce an inferior dashboard. It produces an inferior AI.

YDMS: the metadata mechanics behind document AI

Document AI in real estate fails for the same reason most real-estate AI fails: the underlying content estate is fragmented, untagged, and ungoverned. YDMS addresses this at the architectural layer rather than the application layer.

The handshake

Microsoft 365 enforces multi-factor authentication on every interaction. YDMS includes a designed authentication handshake that completes the auth flow once and maintains the session, allowing users navigating from Voyager into a SharePoint-stored document to bypass repeated re-authentication. Without this handshake, document access becomes a friction point that drives users back to email attachments and shared drives.

Metadata stamping

Per Yardi's official YDMS brochure, every document automatically inherits Voyager-side context: property code, unit, tenant, lease, vendor, job, and other object types. OCR runs on stored documents; full-text indexing makes them searchable by tenant name, address, system code, or content. For multi-jurisdiction operators (Canada, Australia, the Middle East, Europe, the United States), localization settings flow through automatically.

Security stamping

Yardi user roles, property-level restrictions, and object-level permissions push to SharePoint sites, libraries, and folders from Voyager. The same model governs access in Voyager and access in SharePoint. A user without visibility to a property in Voyager cannot see that property's documents in SharePoint.

The compliance dividend

SharePoint sits inside the Microsoft 365 audit log, which records every document interaction: view, edit, share, search. For operators in regulated jurisdictions, including Canada (PIPEDA), the European Union (GDPR), Australia (Privacy Act), and the United Arab Emirates (data residency requirements), the audit log produced by YDMS is the difference between defensible compliance and a remediation project.

Where consulting engagements stall and how to scope them correctly

Each of the three Yardi data products carries a distinct engagement profile. Treating them as interchangeable is the most common scoping error. The categorization below reflects how engagements typically shape up across the Assetsoft delivery experience and is intended as a planning guide rather than a fixed quote.

YDC engagements

Centered on semantic model design, DAX expression authoring, dashboard craft, and Copilot grounding. Smaller delivery teams, design-led typical duration of 6 to 12 weeks for a defined-scope deliverable. The most common pitfall is treating the dashboard as the goal rather than the semantic model. The dashboard is replaceable; the model compounds in value.

Replicate engagements

Centered on lakehouse architecture, schema-on-read patterns, downstream security configuration, and data engineering rigor. Larger delivery teams, architecture-led, with a typical duration of 3 to 6 months for the initial implementation, with ongoing engineering activity afterwards. The most common pitfall is underestimating the volume of downstream work. The warehouse landing zone is the start of the engagement, not the end.

YDMS engagements

Centered on SharePoint information architecture, retention policy configuration, eSignature workflow design, RPA via Power Automate, and SPFx custom application development. Hybrid delivery teams bridging IT and operations, with a typical duration of 8 to 16 weeks for the initial deployment, with ongoing optimization activity. The most common pitfall is treating YDMS as a storage migration rather than as a content governance project.

Where Assetsoft delivers in the Yardi data and content stack

Assetsoft has implemented Yardi data and content architecture for property management, real estate investment, and construction operators since 2012, with delivery teams across Canada, India, Sri Lanka, and the United States serving clients in Canada, the United States, Australia, New Zealand, Singapore, the United Arab Emirates, and broader APAC. Where mixed ERP estates exist, Yardi, alongsideMRI, Procore, ViewPoint, CMiC, or SAP Concur Assetsoft's agreement with KriyaGo, provides 100+ pre-built integrations to close the cross-platform automation gap that pure Yardi consulting cannot reach.

Frequently asked questions

What is the difference between Yardi Data Connect and Yardi Replicate?

Yardi Data Connect (YDC) is a query-driven API that exposes Voyager data into Microsoft Power BI and Microsoft Fabric through a semantic model. Yardi Replicate is a real-time, table-level Change Data Capture stream that delivers raw Voyager data into Snowflake, AWS, or Azure (per Yardi's published support). YDC is the right fit for Microsoft-stack analytics and Copilot use cases. Replicate is the right fit for lakehouse and data-science workloads. Replicate requires Yardi private cloud; YDC does not.

Is Microsoft Fabric Ontology generally available?

As of early 2026, the Ontology item within Microsoft Fabric IQ is in preview, not general availability. Production rollouts should review Microsoft Learn's documented constraints, including the limitation to managed lakehouse tables and column-mapping restrictions on delta tables with special characters in column names.

Can YDC, Replicate, and YDMS be deployed together?

Yes. Mature architectures combine YDMS as the content layer, YDC as the BI and Copilot layer, and Replicate as the raw data layer that feeds a non-Microsoft warehouse for data science teams. The three products serve complementary purposes and are commonly deployed in stages over a 12- to 24-month roadmap.

Does YDMS require additional Microsoft licensing?

YDMS leverages a client's existing Microsoft 365 / SharePoint Online licensing. There is no separate Microsoft platform license to acquire, though Yardi's product fee for YDMS itself applies. Operators with established Microsoft 365 deployments typically experience YDMS as a metadata and security configuration project rather than an infrastructure procurement.

How does Yardi Virtuoso relate to YDC, Replicate, and YDMS?

Virtuoso is Yardi's native AI platform that operates within Voyager. The three data and content products extend AI capability outside Voyager into the Microsoft Power BI and Fabric ecosystem (via YDC), into client-owned data warehouses (via Replicate), and into the SharePoint document estate (via YDMS). Virtuoso and the ABC stack are complementary deployments, not competing platforms.

Ready to build your Yardi data and content architecture?

Assetsoft delivers Yardi Data Connect implementation, Yardi Replicate lakehouse architecture, and Yardi Document Management for SharePoint deployment for property management firms across North America, Australia, the United Kingdom, the Middle East, and APAC. Engagements begin with a structured assessment of the existing data and content estate.

→ Start with an architecture assessment atassetsoft.co

Get Started Now