Task: RAG
Release Date: 6/22/2026
Format: Parquet
Size: 587.12 MB
Share
Parquet export of the Stack2Graph Qdrant vector dataset for the Perl programming language. The archive contains dense/sparse vector shards and manifests.
Licensing
Creative Commons Attribution Share Alike 4.0 International (CC-BY-SA-4.0)
https://spdx.org/licenses/CC-BY-SA-4.0.htmlRestrictions/Special Constraints
Use must comply with the source StackOverflow content license and attribution requirements.
Forbidden Usage
Do not use in ways that violate the source content license, privacy expectations, or applicable law.
Ethical Review
Built from publicly released StackOverflow data dumps; users remain responsible for compliant usage.
Intended Use
Semantic retrieval, KG entry-point finding, RAG experiments, and vector search research.
Generated from StackOverflow XML dumps into per-language dense/sparse vector parquet shards.