Stars
5
Forks
8
Language
PHP
Last Updated
Mar 04, 2021
Similar Repos
Repo | Language | Stars | Description | Updated At |
---|---|---|---|---|
XSLT | 9 | odt ► TEI, extract semantics from offices texts to XML/TEI | Jul 28, 2022 | |
TeX | 2 | LaTeXML post-processing target for the DOC and ODT formats | Feb 10, 2015 | |
Java | 20 | Text conversion tool (from e.g. Word, HTML, txt) to corpus formats TEI or FoLiA) | Aug 11, 2022 | |
Rich Text Format | 1272 | :black_nib: Word Processing Document Library | Aug 11, 2022 | |
Python | 13 | Desktop document generation (.odt, .pdf, .doc, ...) based on appy framework (http://appyframework.org) and OpenOffice/LibreOffice | May 07, 2022 | |
TeX | 4 | Converters between doc, odt, latex, html and xml | May 08, 2021 | |
Perl | 2 | Conversion tools to and from the TEITOK TEI/XML format | Jan 28, 2023 | |
Ruby | 478 | Read text and metadata from files and documents (.doc, .docx, .pages, .odt, .rtf, .pdf) | Oct 05, 2022 | |
PHP | 4 | Extract text from a Word Doc | Jun 01, 2021 | |
XQuery | 6 | The core server-side libraries of TEI Publisher, implementing the TEI Processing Model | Aug 08, 2022 | |
C# | 15 | Generate report from Word document | Mar 09, 2023 | |
TeX | 3 | Conversion of complete letters by Descartes from legacy ascii to XML-TEI | Nov 26, 2022 | |
XSLT | 22 | Ruby gem that converts an HTML page/document into a Microsoft Word `.doc` file | Jul 20, 2022 | |
JavaScript | 3 | extract plain text from a word document | Nov 09, 2013 | |
Ruby | 9 | Extract text contents from Microsoft Word Document | May 13, 2022 | |
Scala | 2 | XML processing with Scala XML, DOM and SAX; DocBook processing and Gradle plugin; TEI and … | Feb 11, 2023 | |
None | 3 | Test server that will serve word document macro templates. | May 10, 2023 | |
C | 524 | Word Mover's Distance from Matthew J Kusner's paper "From Word Embeddings to Document Distances" | Feb 15, 2023 | |
None | 11 | edit and preview .doc or .docx document online. | Jun 08, 2022 | |
Dart | 11 | a word document .docx template plugin to easily populate and generate word documents from templates | Jul 01, 2022 | |
C# | 61 | GemBox.Document is a .NET component that enables you to read, write, convert, and print document … | Apr 11, 2023 | |
Haskell | 5 | Textual case conversion and word boundary manipulation | Jan 03, 2022 | |
Shell | 2 | Custom script to convert odt (and any supported OpenOffice Text document) into mardown | Nov 08, 2022 | |
Python | 114 | Finding document vectors from pre-trained word2vec word vectors | Nov 20, 2021 | |
Shell | 24 | Some useful Nemo Actions and Shell Scripts with zenity GUIs: 1. Sandwich PDF Maker (OCR, … | Aug 19, 2022 | |
Visual Basic .NET | 4 | Use the Word Processing Document API to manage rich text documents in code. | Jun 18, 2022 | |
Python | 2 | A fast trainer/utilities for unsupervised learning of word and doc embeddings from raw text | Oct 14, 2017 | |
C# | 93 | Export to Office(Excel,Word) , Pdf,OpenDocumentFormat( ODS,ODT )from Classes/DataSet/DataTable/IDataReader/JSON/CSV/RSS/ | Mar 09, 2023 | |
JavaScript | 8 | [ARCHIVED] Learn how to load docx files from a NodeJS server and insert them into … | Jul 01, 2022 | |
Kotlin | 78 | This library reads word documents (.doc and .docx), txt and PDF files, and gives the … | Apr 13, 2023 | |
Python | 2 | Tools for word and document embedding using UMAP | May 29, 2021 | |
Jupyter Notebook | 2 | This is a repository that takes as input a word doc or pdf and identifies … | Apr 04, 2023 | |
PHP | 43 | Digital Format Conversion tools enable conversion from Microsoft Word (2007+) DOCX format or HTML to … | Aug 04, 2022 | |
Java | 522 | A standalone Java library/command line tool that converts DOC, DOCX, PPT, PPTX and ODT documents … | Oct 15, 2022 | |
Python | 27 | Custom recipe and utilities for document processing | Aug 01, 2022 | |
Shell | 2 | 📄 Add citations and pdfs from the CERN Document Server | Jan 30, 2023 | |
Python | 2489 | Top2Vec learns jointly embedded topic, document and word vectors. | Apr 11, 2023 | |
Python | 42 | Lbl2Vec learns jointly embedded label, document and word vectors to retrieve documents with predefined topics … | Jul 16, 2022 | |
Go | 72 | Extract text from plaintext, .docx, .odt and .rtf files. Pure go. | Oct 04, 2022 | |
Python | 8 | XML-RPC document conversion server ( Mirror of https://lab.nexedi.com/nexedi/cloudooo ; Please submit patches and issues there … | Jun 07, 2022 | |
Python | 7 | Headless document conversion and printing using LibreOffice or Microsoft Office | Jul 21, 2022 | |
Python | 16 | CSV processing and web related data types mutual conversion | Aug 19, 2022 | |
TypeScript | 2 | View and edit Word document in Angular application without Microsoft Word or Office interop dependencies. | Apr 13, 2022 | |
Vue | 2 | View and edit Word document in Vue application without Microsoft Word or Office interop dependencies. | Jun 14, 2022 | |
Rust | 100 | 🦜 Accessible image processing and conversion from the terminal. Front-end for image-rs/image. | Aug 06, 2022 | |
JavaScript | 737 | Fast and simple report generator, from JSON to pdf, xslx, docx, odt... | Jul 08, 2022 | |
C# | 5 | Find and replace text in a Word document in C# and VB.NET without Microsoft Word … | Sep 06, 2021 | |
Python | 16 | Using Centroids of Word Embeddings and Word Mover's Distance for Biomedical Document Retrieval in Question … | May 27, 2021 | |
Python | 5 | Code for "Unsupervised Abstractive Dialogue Summarization with Word Graphs and POV Conversion" | Dec 02, 2022 | |
Python | 2 | Dump Excel sheets or Word document text and tables as text. | Jul 02, 2021 |