GitPedia
InternScience

InternScience/GraphGen

GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation

2 Releases
Latest: 8mo ago
v0.1.0.post20250930Latest
ChenZiHong-GavinChenZiHong-Gavin·8mo ago·September 30, 2025
GitHub

📋 What's Changed

  • typo(app.py): update arXiv badge by @tpoisonooo in https://github.com/open-sciencelab/GraphGen/pull/11
  • Add citation by @ChenZiHong-Gavin in https://github.com/open-sciencelab/GraphGen/pull/12
  • fix: fix parameter passing error & add logger by @ChenZiHong-Gavin in https://github.com/open-sciencelab/GraphGen/pull/14
  • refactor(graphgen): update imports and adapt GraphGen class by @Tendo33 in https://github.com/open-sciencelab/GraphGen/pull/15
  • feat(webui): support M_synth & M_train from different providers by @ChenZiHong-Gavin in https://github.com/open-sciencelab/GraphGen/pull/20
  • Added the Dockerfile for the GraphGen by @Nick1200000 in https://github.com/open-sciencelab/GraphGen/pull/24
  • add hugging face in readme by @royal-dargon in https://github.com/open-sciencelab/GraphGen/pull/32
  • typo(webui): renaming `SiliconCloud` to `SiliconFlow` by @tpoisonooo in https://github.com/open-sciencelab/GraphGen/pull/34
  • + 17 more

New Contributors

  • @Tendo33 made their first contribution in https://github.com/open-sciencelab/GraphGen/pull/15
  • @Nick1200000 made their first contribution in https://github.com/open-sciencelab/GraphGen/pull/24
  • @royal-dargon made their first contribution in https://github.com/open-sciencelab/GraphGen/pull/32
  • Full Changelog: https://github.com/open-sciencelab/GraphGen/compare/20250422...v0.1.0.post20250930
20250422Pre-release
tpoisonoootpoisonooo·1y ago·April 22, 2025
GitHub

📋 What's Changed

  • GraphGen is a framework for synthetic data generation guided by knowledge graphs, designed to tackle challenges for knowledge-intensive QA generation.
  • By uploading your text chunks (such as knowledge in agriculture, healthcare, or marine science) and filling in the LLM API key, you can generate the training data required by [LLaMA-Factory](https://github.com/hiyouga/LLaMA-Factory) and [xtuner](https://github.com/InternLM/xtuner) online.
  • We will automatically delete user information after completion.

New Contributors

  • @tpoisonooo made their first contribution in https://github.com/open-sciencelab/GraphGen/pull/1
  • @ChenZiHong-Gavin made their first contribution in https://github.com/open-sciencelab/GraphGen/pull/8
  • Full Changelog: https://github.com/open-sciencelab/GraphGen/commits/20250422