文件内容
skill-card.md
## Description: <br>
Set up and optimize RAG pipelines for large datasets with document chunking, embedding benchmarking, vector indexing, and retrieval tuning. <br>
This skill is ready for commercial/non-commercial use. <br>
## Publisher: <br>
[abhinas90](https://clawhub.ai/user/abhinas90) <br>
### License/Terms of Use: <br>
MIT-0 <br>
## Use Case: <br>
Developers and engineers use this skill to bootstrap and tune RAG pipelines over large document collections, including chunking choices, embedding comparisons, vector index operations, and retrieval parameter tuning. <br>
### Deployment Geography for Use: <br>
Global <br>
## Known Risks and Mitigations: <br>
Risk: Document chunks may contain sensitive data, and the security guidance notes privacy considerations if hosted embedding providers are wired in. <br>
Mitigation: Use local embeddings for sensitive corpora or review provider privacy, consent, and redaction requirements before sending chunks to hosted services. <br>
Risk: The artifact scripts use simplified or mock retrieval and embedding behavior, so benchmark results may not reflect production vector search quality. <br>
Mitigation: Validate recommendations against real embedding models, vector stores, and representative test queries before relying on the output. <br>
## Reference(s): <br>
- [ClawHub skill page](https://clawhub.ai/abhinas90/rag-pipeline-starter) <br>
## Skill Output: <br>
**Output Type(s):** [text, markdown, code, shell commands, configuration, guidance] <br>
**Output Format:** [Markdown guidance with Python and shell command examples, plus JSON outputs from bundled helper scripts.] <br>
**Output Parameters:** [1D] <br>
**Other Properties Related to Output:** [Produces local analysis and recommendations; script results depend on user-provided documents, chunks, queries, and indexes.] <br>
## Skill Version(s): <br>
1.0.1 (source: server release evidence and artifact skill.json) <br>
## Ethical Considerations: <br>
Users should evaluate whether this skill is appropriate for their environment, review any generated or modified files before relying on them, and apply their organization's safety, security, and compliance requirements before deployment. <br>