Best Data open-source projects
Data processing and database tools
Top Data repositories
- 1Turn any folder of code into a queryable knowledge graph instantlyTurn any folder of code into a queryable knowledge graph instantly74,9787,453
- 2Turn any GitHub repository into an interactive knowledge graph instantlyTurn any GitHub repository into an interactive knowledge graph instantly43,3594,843
- 3umami-software/umamiBy @the_osps • Modern, privacy-focused alternative to Google Analytics.37,4207,414
- 4One database that's relational, document, graph, and time-series at once.One database that's relational, document, graph, and time-series at once.32,5671,291
- 5HumanSignal/label-studioBy @the_osps • Label Studio is a multi-type data labeling and annotation tool with standardized...27,7173,606
- 6The open-source observability platform that replaces DataDog and NewRelic.The open-source observability platform that replaces DataDog and NewRelic.27,5392,268
- 7Manage your entire budget with a tool that never sends data to the cloudManage your entire budget with a tool that never sends data to the cloud27,2982,625
- 8sinaptik-ai/pandas-aiBy @the_osps • Chat with your database or your datalake (SQL, CSV, parquet).23,6232,334
- 9Your window into all of your dataYour window into all of your data21,1658,596
- 10SQLModel combines Pydantic and SQLAlchemy to cut code duplication.SQLModel combines Pydantic and SQLAlchemy to cut code duplication.18,156867
- 11DataxDatax17,2545,654
- 12QuestDB: an open-source time-series database with SIMD-accelerated SQL and multi...QuestDB: an open-source time-series database with SIMD-accelerated SQL and multi...17,1561,615
- 13googleapis/genai-toolboxBy @the_osps • MCP Toolbox: Open source MCP server for databases.15,7701,621
- 14apache/dorisBy @the_osps • Easy-to-use, high performance and unified analytics database15,5623,851
- 15Tiny RDM: a lightweight cross-platform Redis desktop manager with SSH tunnel and...Tiny RDM: a lightweight cross-platform Redis desktop manager with SSH tunnel and...12,927650
- 16The open-source alternative to Google Analytics you can self-hostThe open-source alternative to Google Analytics you can self-host12,377681
- 17redpanda-data/redpandaBy @the_osps • Redpanda is a streaming data platform for developers.12,279759
- 18The open-source API gateway for REST GraphQL TCP and gRPC trafficThe open-source API gateway for REST GraphQL TCP and gRPC traffic10,7551,157
- 19Your personal intelligence agent. Watches the world from multiple data sources a...Your personal intelligence agent. Watches the world from multiple data sources a...10,3841,648
- 20Extract structured data from diverse document types and languagesExtract structured data from diverse document types and languages8,970802
Recently discovered
- 1Krishnagangwal/CS-FundamentalsCurated CS fundamentals for placement prep: DSA,Computer Networks, DBMS & SQL, OOPs, Operating Systems, System Design & Software Engineering1,460120
- 2kishorekarthck/Discord_Server_Backup___Archive_ToolBackup entire Discord servers: save channels, messages, files, and member list. Archive your community data securely.259
- 3Tiny RDM: a lightweight cross-platform Redis desktop manager with SSH tunnel and...Tiny RDM: a lightweight cross-platform Redis desktop manager with SSH tunnel and...12,927650
- 4QuestDB: an open-source time-series database with SIMD-accelerated SQL and multi...QuestDB: an open-source time-series database with SIMD-accelerated SQL and multi...17,1561,615
- 5RAGLite: lightweight RAG with DuckDB or PostgreSQL and late chunkingRAGLite: lightweight RAG with DuckDB or PostgreSQL and late chunking1,192107
- 6Databunker: self-hosted tokenization for PII that actually encrypts at API levelDatabunker: self-hosted tokenization for PII that actually encrypts at API level1,47194
- 7Renders ADS-B data at 60 fps by tweening between ~1 Hz fixes with no teleporting...Renders ADS-B data at 60 fps by tweening between ~1 Hz fixes with no teleporting...2,970355
- 8SQLModel combines Pydantic and SQLAlchemy to cut code duplication.SQLModel combines Pydantic and SQLAlchemy to cut code duplication.18,156867