Open Source Python Text Processing Software

Browse free open source Python Text Processing Software and projects below. Use the toggles on the left to filter open source Python Text Processing Software by OS, license, language, programming language, and project status.

  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Automate contact and company data extraction Icon
    Automate contact and company data extraction

    Build lead generation pipelines that pull emails, phone numbers, and company details from directories, maps, social platforms. Full API access.

    Generate leads at scale without building or maintaining scrapers. Use 10,000+ ready-made tools that handle authentication, pagination, and anti-bot protection. Pull data from business directories, social profiles, and public sources, then export to your CRM or database via API. Schedule recurring extractions, enrich existing datasets, and integrate with your workflows.
    Explore Apify Store
  • 1
    Utilities for general- and special-purpose documentation. Includes reStructuredText, the easy to read, easy to use, what-you-see-is-what-you-get plaintext markup language.
    Leader badge
    Downloads: 174 This Week
    Last Update:
    See Project
  • 2
    Diffuse
    Diffuse is a graphical tool for comparing and merging text files. It can retrieve files for comparison from Bazaar, CVS, Darcs, Git, Mercurial, Monotone, RCS, Subversion, and SVK repositories.
    Leader badge
    Downloads: 199 This Week
    Last Update:
    See Project
  • 3
    meld-installer

    meld-installer

    Meld Installer for Windows

    Bundles Portable Python (with PyGTK) and Meld together in an easy to use installer. This allows you to not have to worry about setting up Python or PyGTK and you can keep Meld's Python separate from other Python installations on your machine. ** NOTE ** Meld 3.11 and later now have official installers, hence this project is no longer supported. You can download the new installer here: https://download.gnome.org/binaries/win32/meld/. You should uninstall the old 1.8 version before upgrading.
    Downloads: 30 This Week
    Last Update:
    See Project
  • 4
    DrPython is a highly customizable cross-platform ide to aid programming in Python. It was developed with teaching in mind, and has a clean, simple interface. It is written in Python, using wxPython as the gui.
    Downloads: 5 This Week
    Last Update:
    See Project
  • Trumba is an All-in-one Calendar Management and Event Registration platform Icon
    Trumba is an All-in-one Calendar Management and Event Registration platform

    Great for live, virtual and hybrid events

    Publish, promote and track your events more affordably and effectively—all in one place.
    Learn More
  • 5
    Mamba is a extensible xml templates preprocessor wrote in Python. Using it, you can rapidly develop powerful applications ready to integrate with the internet. It can be used to work as a generic CGI program or for generate content.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 6
    DocLite is a simple documentation authoring system, it produces multi-page HTML output (like this document) in a style similar to that found in the Linux HOTWTOs or other DocBook created documents.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 7
    SE|PY is an ActionScript editor written in python, wxPython and using scintilla for text highlight, code collapsing. some features: snippets panel, functions panel and much more. Contain also Flush
    Downloads: 3 This Week
    Last Update:
    See Project
  • 8
    Buzz is a fast graphical editor for XML files with special support for OPML. Using the OPML convergence tools it will edit about any outline and many forms of indented text, including Python. In fact, Buzz was written with Buzz! It is written in P
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    The converter performs automatically the full process of converting the files of a C project into the equivalent C++ files. Classes are created, var and functions becomes attributes and methods and the changes are propagated into all files.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Nonprofit Budgeting Software Icon
    Nonprofit Budgeting Software

    Martus Solutions provides seamless budgeting, reporting, and forecasting tools that integrate with accounting systems for real-time financial insights

    Martus' collaborative and easy-to-use budgeting and reporting platform will save you hundreds of hours each year. It's designed to make the entire budgeting process easier and create unlimited financial transparency.
    Learn More
  • 10

    arCHMage

    A reader and decompiler for files in the CHM format

    arCHMage is a reader and decompiler for files in the CHM format. This is the format used by Microsoft HTML Help, and is also known as Compiled HTML.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    A python module that provides algorithms for advanced search - basically all you need to build a search engine.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Atox is a fully customizable Python library and command-line tool for converting plain text into XML.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    DocScript is an approach to document preparation. It presents tools and utilities to edit and publish documents. The philosophy behind the DocScript project is to utilize the programming tools you're working with anyway in your daily work.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    A modular system for extracting and converting Python docstrings into useful structured formats like HTML, XML, and TeX. Project inactive. Development taken over by Docutils, http://docutils.sourceforge.net/.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    Dvipdfm tool for SCons

    SCons tool to cooperate with dvipdfm program

    SCons is a make replacement providing a range of enhanced features such as automated dependency generation and built in compilation cache support. SCons rule sets are Python scripts so as well as the features it provides itself SCons allows you to use the full power of Python to control compilation. This is a SCons extension (tool) which enables usage of the dvipdfm program to convert dvi files to pdf.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    A graphical, hierarchical note-taking application, using reStructuredText, python and PyQt.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    NovaX is a set of programs that is being devloped for small company's and beginners in the fields of HTML and programming. Coded in Python and C++, this is also a good replacement for MS Office. NOTE: This requires Python to be on your PC. ( Python.org )
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Python-script syntax hi in Scribus

    Python-script syntax hi in Scribus

    Python script syntax highlighting in Scribus

    This Python script syntax highlighting in Scribus was written for the magazine layout "PROgrammer". Preview archives publications can be reached at http://raxp.radioliga.com. #Scribus Has built an incomplete version of the interpreter language Python. 1. First you need to enter at the beginning of the text object tag (eg. [Tem]) 2. Then, while maintaining isolation on a text object, "Stsenarii-> Run script" 3. Select the script. 4. Then, use this: "Stsenariy-> Recent stsenarii-> Script". 5. The script reads the text, looking for keywords, comments, figures, and then allocates them.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    SchemaDoc is a XML-based markup language for documenting XML schemas. The work products include both the vocabulary and a set of tools for combining it with the schema source (e.g. a DTD) to produce documentation in HTML, XML DocBook, LaTeX, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    SrcComDoc allows documentation written in source comment rows to be extracted, formatted and highlighted according to the chosen documentation format. The basic SrcComDoc syntax is source and documentation language independent.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    TextBlob

    TextBlob

    TextBlob is a Python library for processing textual data

    Simple, Pythonic, text processing, Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more. It provides a simple API for diving into common natural language processing (NLP) tasks such as part-of-speech tagging, noun phrase extraction, sentiment analysis, classification, translation, and more. TextBlob stands on the giant shoulders of NLTK and pattern, and plays nicely with both. Supports word inflection (pluralization and singularization) and lemmatization, as well as spelling correction. Add new models or languages through extensions. Also, it comes with a WordNet integration. If you only intend to use TextBlob’s default models (no model overrides), you can pass the lite argument. This downloads only those corpora needed for basic functionality. TextBlob is also available as a conda package.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    TransHelp is designed to assist in checking the consistency of Chinese-English translations in a translation project. It is written in php and python. It is especially useful in collaborative translation projects.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Yet Another Meeting Assistant (YaMA) can help you with minutes of meetings in a simple and efficient manner.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Codemod is a tool/library to assist you with large-scale codebase refactors that can be partially automated but still require human oversight and occasional intervention.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    csart - Clever-Search-And-Replace-Text Search, find and replace text in named files or recursive in all directories (-r). If choosen, strings are only replaced if another key-string occures in the line (-w)
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next