Showing 1704 open source projects for ".pdf"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 1
    DHTMLX Gantt

    DHTMLX Gantt

    GPL version of Javascript Gantt Chart

    DHTMLX Gantt is a full-featured JavaScript library for creating interactive Gantt charts in web applications. It supports drag-and-drop, critical path visualization, and customizable task hierarchies, making it ideal for project planning tools. Its modular architecture and API provide extensive customization for developers.
    Downloads: 18 This Week
    Last Update:
    See Project
  • 2
    Capture Website

    Capture Website

    Capture screenshots of websites

    A command-line tool and API that captures screenshots of websites using headless Chromium, supporting various output formats and customization.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 3
    Distributions.jl

    Distributions.jl

    A Julia package for probability distributions and associated functions

    A Julia package for probability distributions and associated functions.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 4
    GanttProject

    GanttProject

    Official GanttProject repository

    GanttProject is a cross-platform desktop application for project scheduling and Gantt chart creation. It allows users to create tasks, assign resources, and track progress with ease. GanttProject is aimed at small teams and individuals who need offline project planning without the complexity or cost of enterprise tools.
    Downloads: 15 This Week
    Last Update:
    See Project
  • Find Hidden Risks in Windows Task Scheduler Icon
    Find Hidden Risks in Windows Task Scheduler

    Free diagnostic script reveals configuration issues, error patterns, and security risks. Instant HTML report.

    Windows Task Scheduler might be hiding critical failures. Download the free JAMS diagnostic tool to uncover problems before they impact production—get a color-coded risk report with clear remediation steps in minutes.
    Download Free Tool
  • 5
    circuitikz

    circuitikz

    CircuiTikZ TeX/LaTeX package for drawing circuits

    ...It was born mainly for writing Massimo Redaelli's exercise book and exam sheets for the Elettrotecnica courses at Politecnico di Milano, Italy. He wanted a tool that was easy to use, with a lean syntax, native to LaTeX, and supporting direct PDF output format. circuitikz is included with the most common LaTeX systems, so it should work out of the box. Anyway, the main dependency is on TikZ/PGF, xstring and siunitx.
    Downloads: 14 This Week
    Last Update:
    See Project
  • 6
    PDF Split and Merge

    PDF Split and Merge

    Split and merge PDF files on any platform

    Split and merge PDF files with PDFsam, an easy-to-use desktop tool with graphical, command line and web interface.
    Leader badge
    Downloads: 255 This Week
    Last Update:
    See Project
  • 7
    Google Open Source Project Style Guide

    Google Open Source Project Style Guide

    Chinese version of Google open source project style guide

    ...If the project you are modifying originates from Google, you may be directed to the English version of the project page to understand the style used by the project. The Chinese version of the project uses reStructuredText plain text markup syntax, and uses Sphinx to generate document formats such as HTML / CHM / PDF.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 8
    PageIndex

    PageIndex

    Document Index for Vectorless, Reasoning-based RAG

    ...This reasoning-driven retrieval aligns more naturally with how humans explore complex texts, improving relevance and traceability, especially in professional domains like financial reports, legal contracts, and technical manuals. The project includes example notebooks, scripts for tree generation and search, and support for multiple document formats including PDF and markdown, with tools designed to preserve context and semantic boundaries.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    Markdown-Resume-Template

    Markdown-Resume-Template

    Resume template designed to be edited in Markdown

    ...The structure avoids flashy but unreadable designs, favoring a hierarchy and layout that ATS systems and human reviewers can parse quickly. You can fork the repo, make it your own, and export to HTML or PDF using standard tooling. It’s especially useful for developers who want a low-friction, repeatable way to maintain multiple resume variants while keeping formatting consistent.
    Downloads: 2 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 10
    Luxor

    Luxor

    Simple drawings using vector graphics; Cairo "for tourists!"

    ...Luxor is thoroughly procedural and static: your code issues a sequence of simple graphics ‘commands’ until you’ve completed a drawing, and then the results are saved into a PDF, PNG, SVG, or EPS file.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11
    AnythingLLM

    AnythingLLM

    The all-in-one Desktop & Docker AI application with full RAG and AI

    A full-stack application that enables you to turn any document, resource, or piece of content into a context that any LLM can use as references during chatting. This application allows you to pick and choose which LLM or Vector Database you want to use as well as supporting multi-user management and permissions. AnythingLLM is a full-stack application where you can use commercial off-the-shelf LLMs or popular open-source LLMs and vectorDB solutions to build a private ChatGPT with no...
    Downloads: 151 This Week
    Last Update:
    See Project
  • 12
    PDF Tinkerer

    PDF Tinkerer

    Tinker with PDF files

    Tinker with PDF files. Download the JAR file for your OS (e.g. Windows) and double click on it. You will need at least Java 21 (e.g. https://adoptium.net/temurin/releases/?os=any&arch=any&version=21) to run this Desktop-App. The latest releases of PDF Tinkerer can now be found on: https://gitlab.com/gjwu/pdf-tinkerer/-/releases
    Downloads: 3 This Week
    Last Update:
    See Project
  • 13
    Easy DataSet

    Easy DataSet

    A powerful tool for creating datasets for LLM fine-tuning

    Easy DataSet is a comprehensive open-source tool designed to make creating high-quality datasets for large language model fine-tuning, retrieval-augmented generation (RAG), and evaluation as easy and automated as possible by providing intuitive interfaces and powerful parsing, segmentation, and labeling tools. It supports ingesting domain-specific documents in a wide range of formats — including PDF, Markdown, DOCX, EPUB, and plain text — and can intelligently segment, clean, and structure content into rich datasets tailored for downstream LLM training needs. The system includes automated question-generation capabilities, hierarchical label trees, and answer generation pipelines that use LLM APIs to produce coherent paired data with customizable templates. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 14
    OCRBase

    OCRBase

    MD/.JSON Document OCR and structured data extraction API

    OCRBase is a self-hostable document OCR and structured extraction system built to turn PDFs into machine-usable outputs at scale, aiming to bridge the gap between raw text extraction and production-ready pipelines. Instead of treating OCR as a one-off script, it presents an API-driven workflow where documents are submitted as jobs and processed through a queue-based architecture that can handle high throughput. The core output is designed for downstream automation, producing structured...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 15
    HDoujin Downloader

    HDoujin Downloader

    An easy-to-use manga and dōjinshi downloader supporting 800+ webistes

    HDoujin Downloader is a manga and dōjinshi download manager supporting 800+ websites across many different languages.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 16
    Crowbook LaTeX

    Crowbook LaTeX

    Converts books written in Markdown to HTML, LaTeX/PDF and EPUB

    Crowbook's aim is to allow you to write a book in Markdown without worrying about formatting or typography and let the program generate HTML, PDF and EPUB output for you. Its focus is novels and fiction, and the default settings should (hopefully) generate readable books with correct typography without requiring you to worry about it.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    OpenSign

    OpenSign

    🔥 The free & Open Source DocuSign alternative

    The premier open source document signing solution (DocuSign alternative). Welcome to OpenSign, the premier open source docusign alternative - document e-signing solution designed to provide a secure, reliable and free alternative to commercial esign platforms like DocuSign, PandaDoc, SignNow, Adobe Sign, Smartwaiver, SignRequest, HelloSign & Zoho sign. Our mission is to democratize the document signing process, making it accessible and straightforward for everyone.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 18
    Ray Tracing in One Weekend Book Series

    Ray Tracing in One Weekend Book Series

    The Ray Tracing in One Weekend series of books

    ...(While that also frees you from the requirement of providing attribution, it would help the overall project if you could point back to this web site as a service to other users.) These books are formatted for printing directly from your browser, where you can also (on most browsers) save them as PDF. In Ray Tracing in One Weekend, you will build a simple brute-force path tracer. Continuing with Ray Tracing: The Next Week, you will add textures, volumes (like fog), rectangles, instances, lights, and support for lots of objects using a bounding volume hierarchy (BVH). Finally, with Ray Tracing: The Rest Of Your Life, we'll dive into the math of creating a very serious ray tracer.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 19
    Documind

    Documind

    Open-source platform for extracting structured data from documents

    Documind is an advanced document processing tool that leverages AI to extract structured data from PDFs. It is built to handle PDF conversions, extract relevant information, and format results as specified by customizable schemas.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 20
    Percollate

    Percollate

    A command-line tool to turn web pages into beautiful, readable PDF

    Percollate is a command-line tool that turns web pages into beautifully formatted PDF, EPUB, or HTML files. By default, percollate processes URLs in parallel. Use the --wait option to process them sequentially instead, with a pause between items. The delay is specified in seconds, and can be zero. By default, percollate bundles all web pages in a single file. Use the --individual flag to export each source to a separate file.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 21
    OmniTools

    OmniTools

    Self-hosted collection of powerful web-based tools for everyday tasks

    ...A key design choice is that file processing happens entirely on the client side, meaning your data stays in your browser instead of being sent to the backend. The tool catalog spans both technical and non-technical needs, including image, video, audio, PDF, text, date/time, math, and data format utilities like JSON/CSV/XML helpers. It’s also packaged for straightforward self-hosting, with a lightweight Docker image and simple run commands, so it can be deployed quickly on a homelab or internal network.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 22
    Remarkable for Linux

    Remarkable for Linux

    The Markdown Editor for Linux

    ...Remarkable has Github Flavoured Markdown. This has a simple, easy-to-learn syntax with features like checklists, highlighting, links, images and more. Remarkable allows you to export your files to PDF and HTML from within the app. The HTML code is even prettified and PDFs have a TOC. You can style your markdown documents however you like. If you don't like the default styles you can use your own. The code you write is highlighted in the Live Preview. This makes Remarkable great for writing software documentation or even taking lecture notes. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    Zerox OCR

    Zerox OCR

    PDF to Markdown with vision models

    A dead simple way of OCR-ing a document for AI ingestion. Documents are meant to be a visual representation after all. With weird layouts, tables, charts, etc. The vision models just make sense. ZeroX is an open-source machine learning framework designed for fast experimentation and production deployment, optimized for speed and ease of use.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 24
    PDFMathTranslate

    PDFMathTranslate

    PDF scientific paper translation with preserved formats

    PDFMathTranslate is a Python-based tool that uses AI translation to convert academic PDFs into bilingual (e.g. Chinese-English) documents while preserving formatting, including math notation. It supports OCR-enhanced content and offers CLI, GUI, Docker, and Zotero integration under AGPL v3.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 25
    Kaneo

    Kaneo

    An open source project management platform

    Kaneo is a lightweight invoicing and client management tool for freelancers and small businesses. Built as a modern, self-hosted solution, Kaneo helps users generate invoices, track payments, and manage clients from a user-friendly dashboard. It's especially useful for independent workers who prefer local control over SaaS options.
    Downloads: 4 This Week
    Last Update:
    See Project