Show HN: GitHub-assistant – Natural language questions from your GitHub data

github.com

40 points by aazo11 16 hours ago

Simon(sfarshid) and I spend a lot of time on GitHub. As data nerds we put together a quick tool to explore your repository’s data.

How it works:

- Data Loading: We use dlt to pull data (issues, PRs, commits, stars) from GitHub

- Semantic Layer: Relta wraps the underlying dataset into a semantic layer so the LLM doesn’t hallucinate.

- Text-to-SQL: A text-to-SQL agent transforms your plain-English question into a query using the semantic layer

- Generative Charts: assistant-ui dynamically generates a chart based on the SQL query

- Refinements: If the semantic layer can’t handle your question, our agent submits semantic layer improvements via pull requests

Hosted version: https://github-assistant.com

Demo Video: https://youtu.be/ATaf98nID5c

Check out the repo + hosted version and let us know what you think.

diggan 7 hours ago

Is there any information you can get out of this that isn't already available in the GitHub.com UI? I tried asking things like "What could the most interesting information you can tell me about this repository?" but it seems like most of the data is already available in the UI in pretty much the same format, except you click a link to see it instead of writing a question and waiting for a reply.

  • eddywebs 6 hours ago

    Same thoughts

    • aazo11 5 hours ago

      We pull data from the GitHub API which includes data that that is not available from GitHub.com pages. Currently only PR, Issues, Commit and Star data is being loaded. You can also read more here https://medium.com/relta/github-assistant-49ae388ad758

      • aazo11 4 hours ago

        There will new data from the graphql API added over time. Would love your feedback on which data you like to see added https://docs.github.com/en/graphql

        • diggan 18 minutes ago

          Maybe a better question: What questions could be answered with your service, that could not be answered with just cURL + Git + the GitHub API?

bberenberg 8 hours ago

Pretty nifty, is Relta going to be OSS as well?

croemer 16 hours ago

Tried adding a repo I work on, import worked after failing first, but then the query result was that there was no data on top contributors.

Put the video in the Readme!

  • aazo11 15 hours ago

    Hi -- strange that didn't work. Overall, the semantic layer is designed to provide very tight guardrails and not hallucinate. You can see the agent suggest changes to the semantic layer if you give the produced answer a thumbs down.

    The idea is for the system to provide answers that have close to 100% accuracy, but make it a single click for developers to to improve the semantic layer.

  • aazo11 12 hours ago

    Was able to reproduce and pushed an update. Thanks for calling this out.

  • sfarshid 16 hours ago

    Just updated the README, thanks for the suggestion!

eddywebs 6 hours ago

Can this help in explaining how the code works, its schematics or HLD of a given github repo ?

  • aazo11 5 hours ago

    No this currently only answers questions from the GitHub graphql API.