Skip to content

Examples of top-used LangChain document loaders including CSVLoader, DirectoryLoader, PyPDFLoader, TextLoader, and WebBaseLoader. These loaders standardize raw data into LangChain Document objects for further processing, splitting, embeddings, and RAG workflows.

Notifications You must be signed in to change notification settings

tahirkorma/langchain-document-loaders

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

2 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

LangChain Document Loaders

This repository highlights the most commonly used document loaders in LangChain, which are essential for bringing raw data into a standardized Document format. Document loaders serve as the entry point for processing data before applying text splitters, embeddings, and retrieval pipelines.


πŸ“Œ Top Used Document Loaders

  • CSVLoader β†’ For loading structured tabular data from .csv files.
  • DirectoryLoader β†’ For bulk ingestion of multiple files from directories.
  • PyPDFLoader β†’ For parsing and extracting text from PDF documents.
  • TextLoader β†’ For reading plain text (.txt) files.
  • WebBaseLoader β†’ For fetching and parsing web pages directly from URLs.

πŸ”Ž Notes

  • These represent the top-used loaders in practical LLM applications, but LangChain supports many others (e.g., JSONLoader, UnstructuredFileLoader, EverNoteLoader, etc.).
  • Choosing the right loader depends on your data source (local files, web content, structured datasets, etc.).
  • Loaders are typically combined with text splitters to prepare data for downstream tasks such as embeddings, question answering, and RAG.

πŸ“š Resources

About

Examples of top-used LangChain document loaders including CSVLoader, DirectoryLoader, PyPDFLoader, TextLoader, and WebBaseLoader. These loaders standardize raw data into LangChain Document objects for further processing, splitting, embeddings, and RAG workflows.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages