Showing 1882 open source projects for ".pdf"

View related business solutions
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    GanttProject

    GanttProject

    Official GanttProject repository

    GanttProject is a cross-platform desktop application for project scheduling and Gantt chart creation. It allows users to create tasks, assign resources, and track progress with ease. GanttProject is aimed at small teams and individuals who need offline project planning without the complexity or cost of enterprise tools.
    Downloads: 17 This Week
    Last Update:
    See Project
  • 2
    ripgrep

    ripgrep

    Regex pattern directory search tool that respects your .gitignore

    ...By default, ripgrep will ignore your .gitignore and skip hidden files or directories and binary files automatically. ripgrep has first class support on Windows, macOS and Linux, with binary downloads available for every release. ripgrep is similar to other popular search tools like The Silver Searcher, ack and grep. ripgrep supports arbitrary input preprocessing filters which could be PDF text extraction, less supported decompression, decrypting, automatic encoding detection and so on. In other words, use ripgrep if you like speed, filtering by default, fewer bugs and Unicode support.
    Downloads: 70 This Week
    Last Update:
    See Project
  • 3

    Images To PDF

    ImagesToPDF freeware designed to convert image files into PDF

    Images To PDF is a user-friendly freeware application designed to convert image files into PDF documents quickly and efficiently. Whether you're a professional looking to compile multiple images into a single PDF file or an individual seeking a simple tool to organize and share images, Images To PDF offers a seamless solution. With its intuitive interface and straightforward functionality, Images To PDF streamlines the process of converting various image formats, such as JPG, and PNG, into high-quality PDF files. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    HDoujin Downloader

    HDoujin Downloader

    An easy-to-use manga and dōjinshi downloader supporting 800+ webistes

    HDoujin Downloader is a manga and dōjinshi download manager supporting 800+ websites across many different languages.
    Downloads: 14 This Week
    Last Update:
    See Project
  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    Build gen AI apps with an all-in-one modern database: MongoDB Atlas

    MongoDB Atlas provides built-in vector search and a flexible document model so developers can build, scale, and run gen AI apps without stitching together multiple databases. From LLM integration to semantic search, Atlas simplifies your AI architecture—and it’s free to get started.
    Start Free
  • 5
    Percollate

    Percollate

    A command-line tool to turn web pages into beautiful, readable PDF

    Percollate is a command-line tool that turns web pages into beautifully formatted PDF, EPUB, or HTML files. By default, percollate processes URLs in parallel. Use the --wait option to process them sequentially instead, with a pause between items. The delay is specified in seconds, and can be zero. By default, percollate bundles all web pages in a single file. Use the --individual flag to export each source to a separate file.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 6
    Google Open Source Project Style Guide

    Google Open Source Project Style Guide

    Chinese version of Google open source project style guide

    ...If the project you are modifying originates from Google, you may be directed to the English version of the project page to understand the style used by the project. The Chinese version of the project uses reStructuredText plain text markup syntax, and uses Sphinx to generate document formats such as HTML / CHM / PDF.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 7
    PageIndex

    PageIndex

    Document Index for Vectorless, Reasoning-based RAG

    ...This reasoning-driven retrieval aligns more naturally with how humans explore complex texts, improving relevance and traceability, especially in professional domains like financial reports, legal contracts, and technical manuals. The project includes example notebooks, scripts for tree generation and search, and support for multiple document formats including PDF and markdown, with tools designed to preserve context and semantic boundaries.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    Markdown-Resume-Template

    Markdown-Resume-Template

    Resume template designed to be edited in Markdown

    ...The structure avoids flashy but unreadable designs, favoring a hierarchy and layout that ATS systems and human reviewers can parse quickly. You can fork the repo, make it your own, and export to HTML or PDF using standard tooling. It’s especially useful for developers who want a low-friction, repeatable way to maintain multiple resume variants while keeping formatting consistent.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    Luxor

    Luxor

    Simple drawings using vector graphics; Cairo "for tourists!"

    ...Luxor is thoroughly procedural and static: your code issues a sequence of simple graphics ‘commands’ until you’ve completed a drawing, and then the results are saved into a PDF, PNG, SVG, or EPS file.
    Downloads: 2 This Week
    Last Update:
    See Project
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 10
    Crowbook LaTeX

    Crowbook LaTeX

    Converts books written in Markdown to HTML, LaTeX/PDF and EPUB

    Crowbook's aim is to allow you to write a book in Markdown without worrying about formatting or typography and let the program generate HTML, PDF and EPUB output for you. Its focus is novels and fiction, and the default settings should (hopefully) generate readable books with correct typography without requiring you to worry about it.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Shower Presentation Template

    Shower Presentation Template

    Shower HTML presentation engine

    Shower Presentation Template is a shower HTML presentation engine. Built on HTML, CSS and vanilla JavaScript, works in all modern browsers. Themes are separated from engine, and comes with fully keyboard accessible. Printable to PDF and includes Ribbon and Material themes, and core with plugins. You’ll need Node.js installed on your computer. Latest stable versions of Chrome, Edge, Firefox, and Safari are supported.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Easy DataSet

    Easy DataSet

    A powerful tool for creating datasets for LLM fine-tuning

    Easy DataSet is a comprehensive open-source tool designed to make creating high-quality datasets for large language model fine-tuning, retrieval-augmented generation (RAG), and evaluation as easy and automated as possible by providing intuitive interfaces and powerful parsing, segmentation, and labeling tools. It supports ingesting domain-specific documents in a wide range of formats — including PDF, Markdown, DOCX, EPUB, and plain text — and can intelligently segment, clean, and structure content into rich datasets tailored for downstream LLM training needs. The system includes automated question-generation capabilities, hierarchical label trees, and answer generation pipelines that use LLM APIs to produce coherent paired data with customizable templates. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 13
    OCRBase

    OCRBase

    MD/.JSON Document OCR and structured data extraction API

    OCRBase is a self-hostable document OCR and structured extraction system built to turn PDFs into machine-usable outputs at scale, aiming to bridge the gap between raw text extraction and production-ready pipelines. Instead of treating OCR as a one-off script, it presents an API-driven workflow where documents are submitted as jobs and processed through a queue-based architecture that can handle high throughput. The core output is designed for downstream automation, producing structured...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 14
    AnythingLLM

    AnythingLLM

    The all-in-one Desktop & Docker AI application with full RAG and AI

    A full-stack application that enables you to turn any document, resource, or piece of content into a context that any LLM can use as references during chatting. This application allows you to pick and choose which LLM or Vector Database you want to use as well as supporting multi-user management and permissions. AnythingLLM is a full-stack application where you can use commercial off-the-shelf LLMs or popular open-source LLMs and vectorDB solutions to build a private ChatGPT with no...
    Downloads: 144 This Week
    Last Update:
    See Project
  • 15
    OpenSign

    OpenSign

    🔥 The free & Open Source DocuSign alternative

    The premier open source document signing solution (DocuSign alternative). Welcome to OpenSign, the premier open source docusign alternative - document e-signing solution designed to provide a secure, reliable and free alternative to commercial esign platforms like DocuSign, PandaDoc, SignNow, Adobe Sign, Smartwaiver, SignRequest, HelloSign & Zoho sign. Our mission is to democratize the document signing process, making it accessible and straightforward for everyone.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 16
    Ray Tracing in One Weekend Book Series

    Ray Tracing in One Weekend Book Series

    The Ray Tracing in One Weekend series of books

    ...(While that also frees you from the requirement of providing attribution, it would help the overall project if you could point back to this web site as a service to other users.) These books are formatted for printing directly from your browser, where you can also (on most browsers) save them as PDF. In Ray Tracing in One Weekend, you will build a simple brute-force path tracer. Continuing with Ray Tracing: The Next Week, you will add textures, volumes (like fog), rectangles, instances, lights, and support for lots of objects using a bounding volume hierarchy (BVH). Finally, with Ray Tracing: The Rest Of Your Life, we'll dive into the math of creating a very serious ray tracer.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 17
    Documind

    Documind

    Open-source platform for extracting structured data from documents

    Documind is an advanced document processing tool that leverages AI to extract structured data from PDFs. It is built to handle PDF conversions, extract relevant information, and format results as specified by customizable schemas.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 18
    pdf-to-podcast

    pdf-to-podcast

    PDF to Podcast transforms any PDF document into a podcast-ready audio

    PDF to Podcast transforms any PDF document into a podcast-ready audio episode using advanced AI text-to-speech (TTS) providers. Upload a PDF, select your preferred voice and provider, and receive an MP3 and a ready-to-use RSS feed for your podcast app.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    PDF Tinkerer

    PDF Tinkerer

    Tinker with PDF files

    Tinker with PDF files. Download the JAR file for your OS (e.g. Windows) and double click on it. You will need at least Java 21 (e.g. https://adoptium.net/temurin/releases/?os=any&arch=any&version=21) to run this Desktop-App. The latest releases of PDF Tinkerer can now be found on: https://gitlab.com/gjwu/pdf-tinkerer/-/releases
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    PDFMathTranslate

    PDFMathTranslate

    PDF scientific paper translation with preserved formats

    PDFMathTranslate is a Python-based tool that uses AI translation to convert academic PDFs into bilingual (e.g. Chinese-English) documents while preserving formatting, including math notation. It supports OCR-enhanced content and offers CLI, GUI, Docker, and Zotero integration under AGPL v3.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 21
    Sphinx

    Sphinx

    Main repository for the Sphinx documentation builder

    ...It was originally created for the Python documentation, and it has excellent facilities for the documentation of software projects in a range of languages. Of course, this site is also created from reStructuredText sources using Sphinx! HTML (including Windows HTML Help), LaTeX (for printable PDF versions), ePub, Texinfo, manual pages, plain text. Semantic markup and automatic links for functions, classes, citations, glossary terms and similar pieces of information. Easy definition of a document tree, with automatic links to siblings, parents and children. General index as well as a language-specific module index. Automatic highlighting using the Pygments highlighter. ...
    Downloads: 28 This Week
    Last Update:
    See Project
  • 22
    DHTMLX Gantt

    DHTMLX Gantt

    GPL version of Javascript Gantt Chart

    DHTMLX Gantt is a full-featured JavaScript library for creating interactive Gantt charts in web applications. It supports drag-and-drop, critical path visualization, and customizable task hierarchies, making it ideal for project planning tools. Its modular architecture and API provide extensive customization for developers.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 23
    Zerox OCR

    Zerox OCR

    PDF to Markdown with vision models

    A dead simple way of OCR-ing a document for AI ingestion. Documents are meant to be a visual representation after all. With weird layouts, tables, charts, etc. The vision models just make sense. ZeroX is an open-source machine learning framework designed for fast experimentation and production deployment, optimized for speed and ease of use.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 24
    Kaneo

    Kaneo

    An open source project management platform

    Kaneo is a lightweight invoicing and client management tool for freelancers and small businesses. Built as a modern, self-hosted solution, Kaneo helps users generate invoices, track payments, and manage clients from a user-friendly dashboard. It's especially useful for independent workers who prefer local control over SaaS options.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 25
    circuitikz

    circuitikz

    CircuiTikZ TeX/LaTeX package for drawing circuits

    ...It was born mainly for writing Massimo Redaelli's exercise book and exam sheets for the Elettrotecnica courses at Politecnico di Milano, Italy. He wanted a tool that was easy to use, with a lean syntax, native to LaTeX, and supporting direct PDF output format. circuitikz is included with the most common LaTeX systems, so it should work out of the box. Anyway, the main dependency is on TikZ/PGF, xstring and siunitx.
    Downloads: 7 This Week
    Last Update:
    See Project