Skip to content

fix: bound automatic schema extraction input#543

Open
nkgotcode wants to merge 1 commit into
neo4j:mainfrom
nkgotcode:fix/schema-extraction-chunk-input
Open

fix: bound automatic schema extraction input#543
nkgotcode wants to merge 1 commit into
neo4j:mainfrom
nkgotcode:fix/schema-extraction-chunk-input

Conversation

@nkgotcode

Copy link
Copy Markdown

Description

Fixes #457.

This updates automatic schema extraction in the simple KG pipeline so schema extraction consumes splitter chunks instead of the full loader input. It also lets schema extraction accept chunk-list input and uses the first chunk for the schema prompt.

Type of Change

  • Bug fix
  • N/A: New feature
  • N/A: Breaking change
  • N/A: Documentation update
  • N/A: Project configuration change

Complexity

Complexity: Low

How Has This Been Tested?

  • Unit tests
  • N/A: E2E tests
  • N/A: Manual tests

uv run pytest tests/unit/experimental/pipeline/config/template_pipeline/test_simple_kg_builder.py tests/unit/experimental/components/test_schema.py -q

uv run ruff check src/neo4j_graphrag/experimental/components/schema.py src/neo4j_graphrag/experimental/pipeline/config/template_pipeline/simple_kg_builder.py tests/unit/experimental/pipeline/config/template_pipeline/test_simple_kg_builder.py tests/unit/experimental/components/test_schema.py

uv run ruff format --check src/neo4j_graphrag/experimental/components/schema.py src/neo4j_graphrag/experimental/pipeline/config/template_pipeline/simple_kg_builder.py tests/unit/experimental/pipeline/config/template_pipeline/test_simple_kg_builder.py tests/unit/experimental/components/test_schema.py

git diff --check

Checklist

The following requirements should have been met (depending on the changes in the branch):

  • Unit tests have been updated
  • CLA (https://neo4j.com/developer/cla/) has been signed
  • CHANGELOG.md updated if appropriate
  • N/A: Documentation has been updated
  • N/A: E2E tests have been updated
  • N/A: Examples have been updated
  • N/A: New files have copyright header

@nkgotcode nkgotcode requested a review from a team as a code owner June 16, 2026 01:41
@NathalieCharbel NathalieCharbel self-requested a review June 25, 2026 12:42
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[BUG]: schema extraction uses the whole file without splitting in chunks

1 participant