Browse free open source Text Processing software and projects below. Use the toggles on the left to filter open source Text Processing software by OS, license, language, programming language, and project status.

  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 1

    ConcatPDF

    PDF Concatenation Tool

    ConcatPDF is the tool to concatenate PDF files. It can concatenate, extract, encrypt, decrypt, configure PDF files, convert image files to PDF. GUI version and CUI version are both available. iText.NET is iText porting on .NET Framework by J#. This library allows you to generate PDF, (X)HTML, XML, RTF files on Microsoft.NET Framework including ASP.NET.
    Downloads: 67 This Week
    Last Update:
    See Project
  • 2
    Text Encoding Initiative

    Text Encoding Initiative

    TEI produces the TEI Guidelines and associated software

    The TEI is an international and interdisciplinary standard used by libraries, museums, publishers, and academics to represent all kinds of literary and linguistic texts, using an encoding scheme that is maximally expressive and minimally obsolescent.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 3
    Diff-ext is an extension for filemanagers such as Windows Explorer and Nautilus that allows to launch diff/merge tools on selected files.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 4
    GATE
    NOTE THAT THE SOURCE CODE AND ISSUE TRACKER HAVE NOW MOVED TO GITHUB. FIND US AT https://github.com/GateNLP/ GATE (General Architecture for Text Engineering) is an architecture, framework and development environment for developing, evaluating and embedding Human Language Technology. See http://gate.ac.uk for full details.
    Downloads: 2 This Week
    Last Update:
    See Project
  • Go from Code to Production URL in Seconds Icon
    Go from Code to Production URL in Seconds

    Cloud Run deploys apps in any language instantly. Scales to zero. Pay only when code runs.

    Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.
    Try it free
  • 5

    syn Text Editor

    Project is not active any more. Sorry!

    Syn is highly customizable text and programming editor. Syn features syntax highlighting for many languages, active scripting, macro recording, the ability to launch a program (e.g. a Compiler) and capture the output, support for projects, etc.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 6
    Refal.NET - Versatile, Compact yet Powfull Text Transformer and Compiler-Writing System. Based on Refal.NET Virtual Machine (+Refal.NET Compiler), this RAD-tool might be used for rapid prototyping, decreasing up to 10 times development efforts.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 7
    DocWire SDK

    DocWire SDK

    Award-winning modern data processing SDK in C++20

    DocWire SDK, a standout C++20AI driven data processing tool, has received award from SourceForge and strong backing from Microsoft. It handles nearly 100 file types, empowering efficient text extraction, web data extraction, and document analysis. For businesses, the shift to DocWire SDK signifies a leap forward. It promises comprehensive document format support and the ability to extract valuable insights from email boxes, databases, and websites using cutting-edge AI. DocWire SDK aims to expand its capabilities, focusing on versatile data extraction, platform support, and seamless integration with various systems. DocWire SDK is dedicated to streamlining data processing, reducing development time and costs, and harnessing the potential of AI. Its advancements promise a superior experience compared to its predecessor, DocToText.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    A well-known and used MUI custom class which provides application programmers a textedit gadget. It supports features like word wrapping, soft styles (bold, italic, underline), a spell checking interface as well as an AREXX interface for scripting.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    NunniMJAX is a minimal java library for parsing XML. The API and functioning recall SAX and is sequential and event-driven. The parser strives to verify that the XML is well-formed, but no validation. NunniMJAX's FSM has been generated using NunniFSMGen
    Downloads: 0 This Week
    Last Update:
    See Project
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 10
    Requirement Heap is a web based requirement management /business analysis application. It allows to enter requirement in rich text, supports versioning and the management of requirements. It also handles use cases, interviews and test cases. It allows multiple projects. Stakeholders and glossaries can be handled per project or globally.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Tools to update all pages of a web site, at a command. Header and menu may be copied on each page. It is possible also to add a header to all source files of a project (a licence for example).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Texinfo Web Publisher

    Texinfo Web Publisher

    Multi-format web publishing system based on Texinfo

    Texinfo Web Publisher is a Makefile based publishing system featuring simultaneous con- tent creation into HTML, non-split HTML, Framed HTML, HTML Zip, XML, DocBook, PDF, DjVu, PostScript, DVI, Plain text, Info and EPUB book formats. All Texinfo Web Publisher output formats are from a single source. Texinfo Web Publisher can be used for website creation has FTP deployment capabilities and supports Cascading Style Sheets (CSS). Texinfo Web Publisher is a low maintenance solution for publishing websites with multi- format matching content. Texinfo Web Publisher is a wrapper for GNU Texinfo.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    led-text-editor

    led-text-editor

    A simple and easy-to-use but yet powerful line-oriented text editor.

    led is a simple and easy-to-use but yet powerful line-oriented text editor. It is written in Urn Lisp and compiled to Lua, so it is available for every platform where Lua (version 5.1 or higher) is available as well; however some special features are available only with Lua 5.1 (or LuaJIT) on AmigaOS, MorphOS, AROS and UNIX with XTerm. The latest release (18-Mar-2021) now supports also scripts.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Moved to http://code.google.com/p/evil-programmers/
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • Next
MongoDB Logo MongoDB