Announcing Sphynx - A Crowdfunding Project

 Sphynx as you may know is my version of a data catalog, with a twist - Twist was designed to support any data sources without the need for a data lake or a data warehouse. I create the Sphynx project to implement a navigational architecture for metadata called ANA - Analytic Navigational Architecture - an automation framework to manage enterprise data in any format or location for a variety of applications, from core data architecture to AI, Data Science, and Analytics.

ANA - or Sphynx - was created to be the primary interface for anyone in a company wanting to work on data-related projects. With this tool you can: 

  1. Map your data sources, regardless of where they sit - Clouds, ERP, Legacy, Big Data;
  2. Create and run federated queries over existing data sources;
  3. Define data sets for use in AI, reporting and Data Science projects;
  4. Document and classify your metadata;
  5. Generate quick access to reporting, AI and Analytics applications;
  6. Quickly locate any data asset in a corporation;
  7. Identify standards deviations from standards;
  8. Provide and manage data quality patterns in the catalog;
  9. Manage Master data from the source;
  10. Provide quick integration across data sources without ETL or streaming(planned for version 2 onwards).

Check your assets against conceptual definitions - find what is being designed, what has been created, associate a concept with physical data;

In summary, perform most actions demanded from a data architecture department, automating the process and giving more flexibility, besides providing much faster deliveries, ensuring data quality standards and improving delivery, saving time and costs.

Sphynx consists of only a client and a database server to store metadata; initially designed for Windows it will soon be ported to Linux and Macs.

Sphynx does not rely on heavy servers of makes heavy use of resources - it is a heavy client designed to be fast, real fast.

The clients are of two types - The Management console and the User console - while the first performs data ingestion and classification, the last provides navigational features and artifact creation in the catalog.

The catalog repository was created in SQL Server for its multiple capabilities and high flexibility, which makes it a perfect tool for storing and managing metadata. Other versions will support Postgres/Greenplum, Oracle, and Teradata. 

Although targeted at SQL databases, the technology behind Sphynx will allow capturing metadata from any other source, like Text, Hadoop, Cassandra, Excel, and more, through the use of addins.

High Speed

The latest incarnation of Sphynx to be deployed by the end of November will feature the concept of full federation assuming the use of Edge computing to increase speed - the same concept will be used to all platforms considered. allowing a user to create queries over the catalog and run it against any database or combination of databases using a single code and without expensive Data Lakes which bloat your enterprise governance.

Data Residing Anywhere

If you recorded a data source using Sphynx it means you can view your data at the source location, without the need for a Data Lake or Data Warehouse to centralize data - Fast, Light, and easy to use. All you need is a valid connection

Crowdfunding Mode

If you are interested in the project, and to contribute to this project, by either supporting it financially or contributing to the code base, please send me a note at macr2011@gmail.com. You can find the contribution button at the top right of the Blog. Support our project and get a free copy of my book "Modern Data Architecture" - a different view of data architecture in simple steps.

Why use Sphynx 

Companies can automate their data governance and architecture with this powerful tool in record time, saving thousands of hours of work, precious hours of design, and having secure access to corporate data from a single place. No more wasting millions to manage your data. Implement modern data architecture at a fraction of the cost.



Comments