Do you use data-diff?
What is data-diff?
Data Diff is an open-source package that can be run in a CLI or wrapped into any data orchestrator such as Airflow, Dagster, etc. Compare datasets quickly (seconds/minutes) at a large (millions/billions of rows) scale across different databases.
![data-diff media 1](https://ph-files.imgix.net/abdeb6ac-60c9-4189-b115-ccc43cc38123.jpeg?auto=compress&codec=mozjpeg&cs=strip&auto=format&w=256&h=160&fit=crop)
![data-diff media 2](https://ph-files.imgix.net/afdfc4b7-a4da-4dee-a020-1bdd94fbf26e.png?auto=compress&codec=mozjpeg&cs=strip&auto=format&w=256&h=160&fit=crop)
![data-diff media 3](https://ph-files.imgix.net/7c7ec7d1-45f6-4d0e-8c02-7a14029e631f.png?auto=compress&codec=mozjpeg&cs=strip&auto=format&w=256&h=160&fit=crop)
Recent launches
data-diff
Open source data-diff keeps getting better! 💫
In our latest release:
⏱ Faster diffing
🦆 DuckDB support!
✨ Store diff results
➕ and more!
Check out the full release notes here:
https://github.com/datafold/data-diff/releases/tag/v0.3.0
![data-diff image](https://ph-files.imgix.net/c30d8b1e-59ef-405b-a27f-abac875e121c.png?auto=compress&codec=mozjpeg&cs=strip&auto=format&w=150&h=90&fit=crop)
data-diff
Data Diff is an open-source package that can be run in a CLI or wrapped into any data orchestrator such as Airflow, Dagster, etc. Compare datasets quickly (seconds/minutes) at a large (millions/billions of rows) scale across different databases.
![data-diff image](https://ph-files.imgix.net/afdfc4b7-a4da-4dee-a020-1bdd94fbf26e.png?auto=compress&codec=mozjpeg&cs=strip&auto=format&w=150&h=90&fit=crop)