Skip to content

Commit

Permalink
Transition Recap to a metadata gateway
Browse files Browse the repository at this point in the history
After flailing around a bit, I think I've figured out how to split Recap up.

I have created [Twister](https://twister.dev/), which is a Java project that
converts Avro/Proto to and from Java POJOs.

Separately, I have created the [Recap Type Spec](https://recap.build), which
defines how Recap models schemas.

This repo will now become the metadata gateway portion of Recap. It will wrap
metadata sources (data catalogs, DB information_schemas, data lake catalogs,
schema registries, etc) in a single shared API. This should allow data
engineers and infrastructure developers to build software that works with an
organization's stack, whether they use Datahub, Buf.build, Confluent schema
registry, Amundsen, Marquez, or all of the above.

Step one is an MVP that has:

1. Confluent schema registry + Avro support
2. SQLAlchemy support

The Recap type spec is used as the schema API.

Future work includes adding more integrations, hardening the existing
integrations (which are laughably incomplete), and adding support for lineage
using OpenLineage's API as the common format.

I might also add write support (not just read), so you can write metadata
without worrying about which catalog it's going to.
  • Loading branch information
criccomini committed Jun 1, 2023
1 parent eff394d commit 159e009
Show file tree
Hide file tree
Showing 30 changed files with 1,694 additions and 1,921 deletions.
40 changes: 38 additions & 2 deletions pdm.lock

Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.

3 changes: 3 additions & 0 deletions pyproject.toml
Original file line number Diff line number Diff line change
Expand Up @@ -74,6 +74,9 @@ style = [
"pylint>=2.16.1",
"pyright>=1.1.293",
]
kafka = [
"confluent-kafka>=2.1.1",
]

[tool.isort]
profile = "black"
Expand Down
42 changes: 0 additions & 42 deletions recap/catalog/client.py

This file was deleted.

44 changes: 0 additions & 44 deletions recap/catalog/crawler.py

This file was deleted.

53 changes: 0 additions & 53 deletions recap/catalog/server.py

This file was deleted.

109 changes: 0 additions & 109 deletions recap/catalog/storage.py

This file was deleted.

49 changes: 0 additions & 49 deletions recap/cli.py

This file was deleted.

Loading

0 comments on commit 159e009

Please sign in to comment.