Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.
Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
Try Retool free
Find Hidden Risks in Windows Task Scheduler
Free diagnostic script reveals configuration issues, error patterns, and security risks. Instant HTML report.
Windows Task Scheduler might be hiding critical failures. Download the free JAMS diagnostic tool to uncover problems before they impact production—get a color-coded risk report with clear remediation steps in minutes.
Easy Spider is a distributed Perl Web Crawler Project from 2006
Easy Spider is a distributed Perl Web Crawler Project from 2006. It features code from crawling webpages, distributing it to a server and generating xml files from it. The client site can be any computer (Windows or Linux) and the Server stores all data.
Websites that use EasySpider Crawling for Article Writing...
An open source search engine with RESTFul API and crawlers
...Using the web user interface, the crawlers (web, file, database, etc.) and the client libraries (REST/API , Ruby, Rails, Node.js, PHP, Perl) you will be able to integrate quickly and easily advanced full-text search capabilities in your application: Full-text with basic semantic, join queries, boolean queries, facet and filter, document (PDF, Office, etc.) indexation, web scrapping,etc. OpenSearchServer runs on Windows and Linux/Unix/BSD.
Framework for search and display of heterogenous document collections.
...Please visit https://github.com/cdlib/xtf for the latest updates.
Obsolete Description: The eXtensible Text Framework (XTF) is an architecture that supports searching across collections of heterogeneous textual data (XML, PDF, HTML, text, and more), and the presentation of results and documents in a highly configurable manner. Includes highly customized versions of the proven open-source components Lucene and Saxon.
Regain is a Java search engine based on Jakarta Lucene. It provides indexing and searching files for plenty of formats (HTML,XML,doc(x),xls(x),ppt(x),oo,PDF,RTF,mp3,mp4,Java). A TagLibrary eases integrating search results in your JSP based web page.
Deploy in 115+ regions with the modern database for every enterprise.
MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
IDRA (InDexing and Retrieving Automatically) is a tool which allows indexing a wide range of text (TXT, DOC, PDF) and image annotations files (XML), query-based searching, visualizing an index, saving it for re-usability, evaluation, etc.
JSSQL is a database engine developed entirely with JavaScript. It is a tool for developers to interpret and execute SQL statements on a offline database. It has a conversion class developed in PHP for use with relational databases (ex. PostgreSQL, MySQL, etc..) that generates a series of data files, then the database engine in JavaScript (JSSQL) accesses the data through SQL querys and return a set of records, similar to any database engine. This is useful for querying offline databases...
DocInfoRetriever is a Web_based document full-text search engine based on lucene. It allows you to search the contents and metadata of documents . Supported document formats, likes doc, xls, pdf, odt, jpg...etc.,and torrent files.
Mesin pencari berkas doc, docx, ppt, pptx dan pdf (Open Source).
Created by : X-Cisadane (Dwi). Greetz to : XCode, Dunia Santai, Depok Cyber, Borneo Crew, Muslim Hackers, Hacker Cisadane, UG-HotZone 567.
Booletin es un buscador de Boletines oficiales (BOE, BOCM, etc.), que incluye un sistema de alertas por correo electrónico. Utiliza Apache Lucene para indexar el contenido en pdf de los boletines oficiales de España.
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.
You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
A utility to extract meta-information (properties/comments) out of various file-types; e.g. HTML, PDF, RTF & various Office documents; OGG/MP3 files and JPEG/PNG/GIF images, which can be presented in various output formats (HTML, XML, LaTeX & plain t
PDFBox is a Java PDF Library. This project will allow access to all of the components in a PDF document. More PDF manipulation features will be added as the project matures. This ships with a utility to take a PDF document and output a text file.
Webfilemanger, written in OO-Php, with fulltext retrieval capabilities
(just for PDF files at the moment...).
Interface similar to explorer/konqueror, with tree structure on the left side.
mod_mysql_auth integrated to grant user control, and OWASP ph
YetAnotherMediaLibrary is a database application, that allows users to index their personal documents and media like images, books, pdf, cds, dvds etc.
The library is accessed through a themeable web interface. The Database backend is MySQL.
A web-based search interface tailored to the New Zealand Gazette PDF archive for the NZ library community. A generic Python-based Swish-e search interface.
Spencer is a Java-based, web-hosted filesystem indexing application. It indexes files on network shares, reads inside MSOffice, Open/StarOffice, PDF and zip files and provides a web interface to the index with search functions to find the file you want.
This projects implements a complete entreprise solution based on lucene. It's a smart engine implemented to index numerous files formats (pdf, ps, xls, doc, ppt, ). The engine can index file systems (filtering), databases, mailing folders, web sites and
Satellite is a Perl website index/search package meant for indexing and searching medium size websites. Satellite currently supports text (.txt, .html etc) and pdf files.
<br><br><a href=http://satellite2.sourceforge.net>Go here for a demo</a>
JSSindex (The JavaScript Search Engine) provides full-text search for collections of documents in HTML, PS, PDF, and DjVu. The index and query engine are entirely contained in JavaScript/HTML files. Therefore, searching merely requires a Web browser.
E-Xoops Digger is an advanced search engine for Xoops and E-Xoops. Features are content indexing (like pdf, doc, xls), display results by rank, limit to a certain module, results with text spnippet highlighted, fuzzy search, exact search....
Kassandra is an SQL-based Latent Semantic Indexing and search engine written mostly in PHP. Supported formats will be at least HTML, Postscript and PDF.
100% Java multithread search engine. Communication between the client and server is transferred through TCP-IP. To index objects, it obtains the documents through HTTP protocol and parses HTML files, PDF files, XML files and Text Plain files. Artlight use
A SOAP-based Document/File-Sharing solution written in Java. It includes a basic web-interface but other clients are possible. You can share and download all common office document formats like MS Word, Excel, OpenOffice and PDF.