> ## Documentation Index
> Fetch the complete documentation index at: https://docs.lyzr.ai/llms.txt
> Use this file to discover all available pages before exploring further.

# Parse Pdf

> Upload and process a PDF file using specified parsing and chunking configurations.



## OpenAPI

````yaml post /parse/pdf/
openapi: 3.0.3
info:
  title: Lyzr Knowledge Base - Parse PDF
  version: 1.0.0
  description: >-
    Accepts a PDF file and processes it using a specified parser, applying
    chunking parameters for RAG system preparation.
servers:
  - url: https://rag-prod.studio.lyzr.ai/v3
security: []
paths:
  /parse/pdf/:
    post:
      summary: Parse Pdf
      description: >-
        Upload and process a PDF file using specified parsing and chunking
        configurations.
      operationId: parsePdf
      requestBody:
        required: true
        content:
          multipart/form-data:
            schema:
              type: object
              required:
                - file
              properties:
                file:
                  type: string
                  format: binary
                  description: The PDF file to upload.
                data_parser:
                  type: string
                  nullable: true
                  description: The parser engine to use (e.g., 'llmsherpa').
                  example: llmsherpa
                chunk_size:
                  type: integer
                  nullable: true
                  description: Size of the chunks for text splitting.
                  example: 1000
                chunk_overlap:
                  type: integer
                  nullable: true
                  description: Overlap between consecutive text chunks.
                  example: 100
                extra_info:
                  type: string
                  description: >-
                    Optional JSON string of extra metadata to associate with the
                    parsed documents.
                  example: '{}'
      responses:
        '200':
          description: PDF file successfully parsed and documents returned.
          content:
            application/json:
              schema:
                type: string
                description: Placeholder for a list of processed documents.
        '403':
          description: Forbidden - API key missing or invalid.
          content:
            application/json:
              schema:
                type: object
                properties:
                  detail:
                    type: string
                    example: API key missing
        '422':
          description: Validation Error (e.g., missing required fields or invalid input).
          content:
            application/json:
              schema:
                type: object
                properties:
                  detail:
                    type: array
                    items:
                      type: object
                      properties:
                        loc:
                          type: array
                          items:
                            oneOf:
                              - type: string
                              - type: integer
                        msg:
                          type: string
                        type:
                          type: string
      security:
        - ApiKeyAuth: []
components:
  securitySchemes:
    ApiKeyAuth:
      type: apiKey
      in: header
      name: x-api-key

````