Showing 22 open source projects for ".pdf"

View related business solutions
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • Find Hidden Risks in Windows Task Scheduler Icon
    Find Hidden Risks in Windows Task Scheduler

    Free diagnostic script reveals configuration issues, error patterns, and security risks. Instant HTML report.

    Windows Task Scheduler might be hiding critical failures. Download the free JAMS diagnostic tool to uncover problems before they impact production—get a color-coded risk report with clear remediation steps in minutes.
    Download Free Tool
  • 1
    Easyspider - Distributed Web Crawler

    Easyspider - Distributed Web Crawler

    Easy Spider is a distributed Perl Web Crawler Project from 2006

    Easy Spider is a distributed Perl Web Crawler Project from 2006. It features code from crawling webpages, distributing it to a server and generating xml files from it. The client site can be any computer (Windows or Linux) and the Server stores all data. Websites that use EasySpider Crawling for Article Writing...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    OpenSearchServer Search Engine

    OpenSearchServer Search Engine

    An open source search engine with RESTFul API and crawlers

    ...Using the web user interface, the crawlers (web, file, database, etc.) and the client libraries (REST/API , Ruby, Rails, Node.js, PHP, Perl) you will be able to integrate quickly and easily advanced full-text search capabilities in your application: Full-text with basic semantic, join queries, boolean queries, facet and filter, document (PDF, Office, etc.) indexation, web scrapping,etc. OpenSearchServer runs on Windows and Linux/Unix/BSD.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3

    eXtensible Text Framework (XTF)

    Framework for search and display of heterogenous document collections.

    ...Please visit https://github.com/cdlib/xtf for the latest updates. Obsolete Description: The eXtensible Text Framework (XTF) is an architecture that supports searching across collections of heterogeneous textual data (XML, PDF, HTML, text, and more), and the presentation of results and documents in a highly configurable manner. Includes highly customized versions of the proven open-source components Lucene and Saxon.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 4
    Regain is a Java search engine based on Jakarta Lucene. It provides indexing and searching files for plenty of formats (HTML,XML,doc(x),xls(x),ppt(x),oo,PDF,RTF,mp3,mp4,Java). A TagLibrary eases integrating search results in your JSP based web page.
    Downloads: 16 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    IDRA (InDexing and Retrieving Automatically) is a tool which allows indexing a wide range of text (TXT, DOC, PDF) and image annotations files (XML), query-based searching, visualizing an index, saving it for re-usability, evaluation, etc.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    JavaScript SQL (JSSQL)

    JavaScript SQL (JSSQL)

    A database engine entirely in JavaScript (AJAX)

    JSSQL is a database engine developed entirely with JavaScript. It is a tool for developers to interpret and execute SQL statements on a offline database. It has a conversion class developed in PHP for use with relational databases (ex. PostgreSQL, MySQL, etc..) that generates a series of data files, then the database engine in JavaScript (JSSQL) accesses the data through SQL querys and return a set of records, similar to any database engine. This is useful for querying offline databases...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    DocInfoRetriever is a Web_based document full-text search engine based on lucene. It allows you to search the contents and metadata of documents . Supported document formats, likes doc, xls, pdf, odt, jpg...etc.,and torrent files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Google Documents Finder
    Mesin pencari berkas doc, docx, ppt, pptx dan pdf (Open Source). Created by : X-Cisadane (Dwi). Greetz to : XCode, Dunia Santai, Depok Cyber, Borneo Crew, Muslim Hackers, Hacker Cisadane, UG-HotZone 567.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    Booletin es un buscador de Boletines oficiales (BOE, BOCM, etc.), que incluye un sistema de alertas por correo electrónico. Utiliza Apache Lucene para indexar el contenido en pdf de los boletines oficiales de España.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 10
    A utility to extract meta-information (properties/comments) out of various file-types; e.g. HTML, PDF, RTF & various Office documents; OGG/MP3 files and JPEG/PNG/GIF images, which can be presented in various output formats (HTML, XML, LaTeX & plain t
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    PDFBox is a Java PDF Library. This project will allow access to all of the components in a PDF document. More PDF manipulation features will be added as the project matures. This ships with a utility to take a PDF document and output a text file.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 12
    Webfilemanger, written in OO-Php, with fulltext retrieval capabilities (just for PDF files at the moment...). Interface similar to explorer/konqueror, with tree structure on the left side. mod_mysql_auth integrated to grant user control, and OWASP ph
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    YetAnotherMediaLibrary is a database application, that allows users to index their personal documents and media like images, books, pdf, cds, dvds etc. The library is accessed through a themeable web interface. The Database backend is MySQL.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    A web-based search interface tailored to the New Zealand Gazette PDF archive for the NZ library community. A generic Python-based Swish-e search interface.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Spencer is a Java-based, web-hosted filesystem indexing application. It indexes files on network shares, reads inside MSOffice, Open/StarOffice, PDF and zip files and provides a web interface to the index with search functions to find the file you want.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    This projects implements a complete entreprise solution based on lucene. It's a smart engine implemented to index numerous files formats (pdf, ps, xls, doc, ppt, ). The engine can index file systems (filtering), databases, mailing folders, web sites and
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Satellite is a Perl website index/search package meant for indexing and searching medium size websites. Satellite currently supports text (.txt, .html etc) and pdf files. <br><br><a href=http://satellite2.sourceforge.net>Go here for a demo</a>
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    JSSindex (The JavaScript Search Engine) provides full-text search for collections of documents in HTML, PS, PDF, and DjVu. The index and query engine are entirely contained in JavaScript/HTML files. Therefore, searching merely requires a Web browser.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    E-Xoops Digger is an advanced search engine for Xoops and E-Xoops. Features are content indexing (like pdf, doc, xls), display results by rank, limit to a certain module, results with text spnippet highlighted, fuzzy search, exact search....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Kassandra is an SQL-based Latent Semantic Indexing and search engine written mostly in PHP. Supported formats will be at least HTML, Postscript and PDF.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    100% Java multithread search engine. Communication between the client and server is transferred through TCP-IP. To index objects, it obtains the documents through HTTP protocol and parses HTML files, PDF files, XML files and Text Plain files. Artlight use
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    A SOAP-based Document/File-Sharing solution written in Java. It includes a basic web-interface but other clients are possible. You can share and download all common office document formats like MS Word, Excel, OpenOffice and PDF.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next