Showing 112 open source projects for ".pdf"

View related business solutions
  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    The database for AI-powered applications.

    MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.
    Start Free
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • 1
    Common Resource Grep - crgrep

    Common Resource Grep - crgrep

    Common Resource Grep

    CRGREP searches for matching text in databases, various document formats, archives and other difficult to access resources. A command line tool for name and content text matching in database tables, plain files, MS Office documents, PDF, archives, MP3 audio, image meta-data, scanned documents, maven dependencies and web resources. CRGREP will search resources within resources of any arbitrary combination or depth, so text within a document within a zip archive, and so on. Here you will find binary downloads and discussion (https://sourceforge.net/p/crgrep/discussion/) . ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    pdf-extractor

    pdf-extractor

    Node.js module for rendering pdf pages to images, svgs and HTML files

    Pdf-extractor is a wrapper around pdf.js to generate images, svgs, html files, text files and json files from a pdf on node.js. A DOM Canvas is used to render and export the graphical layer of the pdf. Canvas exports *.png as a default but can be extended to export to other file types like .jpg. Pdf objects are converted to svg using the SVGGraphics parser of pdf.js.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3
    Super-PDF-Editor

    Super-PDF-Editor

    World's most comprehensive, powerful, process-based PDF editor

    World's most comprehensive, powerful, process-based and lighting fast PDF reader, editor and batch processor. PDF editing with 60+ features rich tools and function like OCR pdf and images and produce output like searchable PDF, Text, Hocr, Box, Unlv. Also, improve image enhancement before OCR operation for better OCR performance. pdf Imposition, etc. Super PDF Editor is best for bulk pdf processing, especially for the printing industry. ...
    Downloads: 18 This Week
    Last Update:
    See Project
  • 4
    Super-PDF-Editor-Lite

    Super-PDF-Editor-Lite

    World's most comprehensive, powerful, process-based PDF editor

    ...Super PDF Editor is best for bulk pdf processing, especially for the printing industry. Easy pdf imposition, booklet, n ups pages, and more. OCR performs in pdf files, scanned pdf files and any pdf files. OCR performs in image files, and supports multiple image formats. Auto and manual image enhancement for better OCR accuracy and quality.
    Downloads: 10 This Week
    Last Update:
    See Project
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 5
    abstract2paper

    abstract2paper

    Auto-generate an entire paper from a prompt or abstract using NLP

    ...Click the "doohicky" link above to get started, and then click the link to open the demo notebook in Google Colaboratory. To run the demo as a Jupyter notebook (e.g., locally), use this version instead. Note: to compile a PDF of your auto-generated paper (when you run the demo locally), you'll need to have a working LaTeX installation on your machine (e.g., so that pdflatex is a recognized system command). The notebook will also automatically install the transformers library if it's not already available in your local environment. In its unmodified state, the demo notebooks use the abstract from the GPT-3 paper as the "seed" for a new paper. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 6
    Dickinson

    Dickinson

    Text generation language

    Dickinson is a text-generation language. You can try out the language on the web without installing anything. Binaries for some platforms are available on the releases page. There is an install script that will try to download the right release for your computer.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    OpenKM Document Management - DMS

    OpenKM Document Management - DMS

    Document Management System and Content Management System

    ...Thanks to its elegant and intuitive interface, OpenKM transforms complex operations into easy tasks. The most relevant functions of OpenKM is the indexing of the most common types of files: text, Office, Office 2007, OpenOffice, PDF, HTML, XML, MP3, JPEG, etc. For a complete feature list take a look at http://goo.gl/au8cQy
    Leader badge
    Downloads: 643 This Week
    Last Update:
    See Project
  • 8
    MANTI

    MANTI

    MANTI - Mastering Advanced N-Termini Interpretation

    ...A graphical interface is available as Yoğurtlu_MANTI (a Perl/Tk script) + execut. application versions for Win1x without the need to have Perl installed locally. For a very detailed explanation of script parameters and the evaluation strategy, please consult the extensive manual PDF
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    gImageReader

    gImageReader

    A graphical frontend to tesseract-ocr

    gImageReader is a simple Gtk/Qt front-end to tesseract. Features include: - Import PDF documents and images from disk, scanning devices, clipboard and screenshots - Process multiple images and documents in one go - Manual or automatic recognition area definition - Recognize to plain text or to hOCR documents - Recognized text displayed directly next to the image - Post-process the recognized text, including spellchecking - Generate PDF documents from hOCR documents **Note**: This page is only a mirror for the downloads. ...
    Leader badge
    Downloads: 152 This Week
    Last Update:
    See Project
  • Find Hidden Risks in Windows Task Scheduler Icon
    Find Hidden Risks in Windows Task Scheduler

    Free diagnostic script reveals configuration issues, error patterns, and security risks. Instant HTML report.

    Windows Task Scheduler might be hiding critical failures. Download the free JAMS diagnostic tool to uncover problems before they impact production—get a color-coded risk report with clear remediation steps in minutes.
    Download Free Tool
  • 10
    Next Generation Programming

    Next Generation Programming

    Compose Software Without Writing Any Programing Code

    "Next Generation Programming - Programming Without Coding Software" is a drag-drop wizard for creating simple or complex applications without writing any programming language code The Software is coded/designed with "Java Programming Language" for novice/expert programmers; Programmers can write softwares with visual tools : drag-drop components;visual editors... Programmers can use the software to compose of simple/complex applications : Database programs, circuit design, generate...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    Paperless-ng

    Paperless-ng

    A supercharged version of paperless, scan, index and archive docs

    Paperless is a simple Django application running in two parts, a Consumer (the thing that does the indexing) and a Web server (the part that lets you search & download already-indexed documents). Paper is a nightmare. Environmental issues aside, there’s no excuse for it in the 21st century. It takes up space, collects dust, doesn’t support any form of a search feature, indexing is tedious, it’s heavy and prone to damage & loss. I wrote this to make “going paperless” easier. I do not have to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    transferlearning-tutorial

    transferlearning-tutorial

    Tutorial on applied transfer learning

    A bilingual (English/Chinese) concise tutorial on applied transfer learning, hosted by Jindong Wang. It includes LaTeX source for a compact handbook covering theory, algorithms, surveys, and code in MATLAB and Python.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    Parsr

    Parsr

    Transforms PDF, Documents and Images into Enriched Structured Data

    Parsr is an open-source document parsing tool that converts PDFs, scanned images, and other structured documents into structured, machine-readable data formats.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 14
    ANDTool

    ANDTool

    Analysis Nuclei DAB (AND) Tool

    ...ANDTool is written in MATLAB (The MathWorks, Inc., Massachusetts, USA) and the source code and standalone versions are here available for download. USER MANUAL: see the specific PDF available in the Files section. REQUIREMENTS: MATLAB R2017b and Image Processing Toolbox 10.1 or later versions. MAIN CONTACT: Filippo Piccinini (E-mail: filippo.piccinini85@gmail.com)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    MANTI.pl / muda.pl

    MANTI.pl / muda.pl

    muda.pl - MQ unified data assembler

    ...Maybe also useful for normal proteomics purposes but this script is heavily optimized for protein neo-termini identification and validation. For a more thorough explanation of script parameters and evaluation strategy, please consult the extensive manual PDF.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Linux-Intelligent-Ocr-Solution

    Linux-Intelligent-Ocr-Solution

    Easy-OCR solution and Tesseract trainer for GNU/Linux

    Linux-intelligent-ocr-solution Lios is a free and open source software for converting print in to text using either scanner or a camera, It can also produce text out of scanned images from other sources such as Pdf, Image, Folder containing Images or screenshot. Program is given total accessibility for visually impaired. A Tesseract Trainer GUI is also shipped with this package. Forum : https://groups.google.com/forum/#!forum/lios Video Tutorial : https://www.youtube.com/playlist?list=PLn29o8rxtRe1zS1r2-yGm1DNMOZCgdU0i Tesseract Training Tutorial (beta) : https://www.youtube.com/watch?...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 17

    neocr

    Provides OCR solutions for Nepali, based on Tesseract 4.0.

    NeOCR is a free software based on Tesseract (Open Source OCR Engine) for the Windows operating system. It provides an easy and user-friendly user interface to recognize texts contained in images as well as PDF documents and convert to editable text formats (.txt, .doc, .docx). This product is accessible to Blind and Visually Impaired peoples (tested with NVDA and Narrator).
    Downloads: 14 This Week
    Last Update:
    See Project
  • 18

    Merge PDF Files

    It is a Windows library that merges standard PDFs into a final PDF

    ...There are lots of SDKs on the market creating (merging) PDFs (almost all of them have limitations). Our Windows library (MergePDFByNMI.dll) only merges standard PDF files (there are several PDF formats). You can send the input PDFs (by file name or by byte array) and you can have the final PDF (saved on a file or get back on a byte array). The library calls can be synchronous or asynchronous. We want to give you a benchmark, the library was used to create a PDF from single page(scanned) image by an OCR SDK (it is not included in our library, you can use any on the market): 20,000 Images (the OCR SDK creates single page PDF text searchable, running 50 threads) in 80 minutes. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    PyResParser

    PyResParser

    A simple resume parser used for extracting information from resumes

    PyResParser is a simple resume parser that extracts information from resumes, aiding in the automation of resume-processing tasks.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 20

    PDF2EpubMaker

    Convert PDF to epub by OCR

    Qt Application to convert PDF in EPub format with several step : - convert PDF to png with libpoppler - convert pnf to txt by libtesseract - suppress hyphenate - spell checkinng
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Computer Science Books

    Computer Science Books

    Computer Science Books Computer Technology Books PDF

    The books in this warehouse come from the Internet, and the copyright belongs to the original author. It is not for profit, but only for learning and use. If there is any infringement, please contact us.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    MIT Deep Learning Book

    MIT Deep Learning Book

    MIT Deep Learning Book in PDF format by Ian Goodfellow

    The Deep Learning textbook is a resource intended to help students and practitioners enter the field of machine learning in general and deep learning in particular. The online version of the book is now complete and will remain available online for free. MIT Deep Learning Book in PDF format (complete and parts) by Ian Goodfellow, Yoshua Bengio and Aaron Courville. An MIT Press book Ian Goodfellow and Yoshua Bengio and Aaron Courville. Written by three experts in the field, Deep Learning is the only comprehensive book on the subject. This is not available as PDF download. So, I have taken the prints of the HTML content and bound them into a flawless PDF version of the book, as suggested by the website itself. ...
    Downloads: 21 This Week
    Last Update:
    See Project
  • 23
    NASH OS

    NASH OS

    Nash Operating System for Modern Ecommerce

    The all-built-in-one, automatic, ready-to-go out-of-box, easy-to-use state-of-the-art, and really awesome NASH OS! Over 25,000+ flexible features and controls and all scalable!! The most powerful solution ever built to instantly deliver new heights of online ecommerce enterprise to you.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24

    OCR Template Creator

    Create template for images or PDF files to be OCR'ed and databased

    An OCR application which allows for tag/value templates to be created with a Web GUI for handling the automatic processing of PDF's or images of documents/receipts/contracts etc. Interface to Tesseract
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    mbFXWords

    mbFXWords

    Analyze text. Diagonal read subject, predicate, obj. Search other pdf.

    ... - Divide plain text: subject, predicate, object. - Count words: stemming. - Search for similar content: pdf's. Gives out subject, predicate and object of sentences of pdf and plain text files. Provides comfortable GUI. Automatic language detection.
    Downloads: 0 This Week
    Last Update:
    See Project