Skip to content

MCP Data Platform Ecosystem

mcp-s3 is part of a broader suite of open-source MCP servers designed to work together as a composable data platform. Each component can run standalone or be combined to give AI assistants unified access to storage, query engines, and metadata catalogs.


mcp-data-platform

The orchestration layer that ties the ecosystem together. mcp-data-platform provides a single MCP server that coordinates access across S3 storage, Trino query engines, and DataHub metadata catalogs. Rather than configuring each MCP server independently, mcp-data-platform presents a unified interface where AI assistants can discover datasets through the catalog, query them through Trino, and access the underlying files in S3, all from one connection. It handles connection routing, credential management, and cross-service context so that assistants can work with data end-to-end without switching between tools.

mcp-datahub

An MCP server for DataHub, the open-source metadata platform. mcp-datahub lets AI assistants search and browse the data catalog, inspect dataset schemas, trace column-level lineage, and look up business glossary terms. When paired with mcp-trino and mcp-s3, it provides the discovery layer: assistants can find the right dataset by name or description, understand its structure and ownership, then seamlessly query or retrieve the data. It supports tags, domains, data products, and quality scores, giving assistants the context they need to work with data responsibly.

mcp-trino

An MCP server for Trino, the distributed SQL query engine. mcp-trino enables AI assistants to run read-only SQL queries across any data source that Trino connects to, including data lakes, warehouses, and relational databases. Assistants can list catalogs and schemas, describe tables, explain query plans, and execute analytical queries with configurable timeouts and row limits. Combined with mcp-datahub for discovery and mcp-s3 for raw file access, mcp-trino completes the platform by providing the structured query interface that turns raw data into answers.