Writer PDF Parser
This notebook provides a quick overview for getting started with the Writer PDFParser
document loader.
Writer's PDF Parser converts PDF documents into other formats like text or Markdown. This is particularly useful when you need to extract and process text content from PDF files for further analysis or integration into your workflow. In langchain-writer
, we provide usage of Writer's PDF Parser as a LangChain document parser.
Overviewโ
Integration detailsโ
Class | Package | Local | Serializable | JS support | Package downloads | Package latest |
---|---|---|---|---|---|---|
PDFParser | langchain-writer | โ | โ | โ |
Setupโ
The PDFParser
is available in the langchain-writer
package:
%pip install --quiet -U langchain-writer
Credentialsโ
Sign up for Writer AI Studio to generate an API key (you can follow this Quickstart). Then, set the WRITER_API_KEY environment variable:
import getpass
import os
if not os.getenv("WRITER_API_KEY"):
os.environ["WRITER_API_KEY"] = getpass.getpass("Enter your Writer API key: ")
It's also helpful (but not needed) to set up LangSmith for best-in-class observability. If you wish to do so, you can set the LANGSMITH_TRACING
and LANGSMITH_API_KEY
environment variables:
# os.environ["LANGSMITH_TRACING"] = "true"
# os.environ["LANGSMITH_API_KEY"] = getpass.getpass()