Questions tagged [text-processing]
Software related to processing text in any way, e.g. formatting, filtering, searching and replacing, etc.
157
questions
1
vote
1
answer
41
views
When awk becomes too cumbersome, what is the next classic Unix tool to consider to deal with text transformation?
Awk is invaluable for many purposes where text filter logic spans multiple lines and you need to maintain state (unlike grep and sed), but as I'm finding lately there may be cases where you need ...
2
votes
0
answers
12
views
Software/web service that can extract quoted text and attributions from fiction
I have need of a tool that can read through a (potentially long) work of fiction (e.g. short story, novel, etc) and find instances where a character speaks and provide as output the name of the ...
2
votes
0
answers
18
views
CPU-based LLM for Markdown editor
I'd like to integrate an LLM into my free, open-source, cross-platform desktop Markdown editor. In particular, bundle the LLM into the editor's ancillary podman container (that's used for typesetting)....
0
votes
0
answers
6
views
Math accessibility for math kids on Android via Gboard (or other comfortable math keyboard) via plugins and software engineering
I tried posting this on the TeX - LaTeX stack exchange, but couldn't because I kept getting an error (I couldn't identify the error).
I am looking for a predictive system I want to use on Android to ...
0
votes
0
answers
54
views
Blur pdf document
I am looking for a tool to blur a pdf document like below; so far I found only online and offline tools to black out text (e.g. Adobe, smallpdf, redactpdf, pdffiller etc.).
0
votes
0
answers
198
views
Unblur pdf document
I've a pdf document and some pages are blurred (see figure below please). I tried to unblur it by using ScanWritr and AvePDF, but without any success.
Could you suggest me a tool to do that please?
0
votes
0
answers
13
views
Dictation: computer speaks, learner writes
I am wondering whether there is any software on the Android Google Play store that does the following:
Text is read out
The software waits for the text that is read out to be written at the keyboard....
2
votes
1
answer
66
views
Is there a (command line) tool to calculate percentage change between two files of the same format full of numbers?
Is there a (command line) tool to calculate percentage change between two files of the same format full of numbers?
Example:
File A:
100 100
File B:
150 70
Invocation:
$ calc_perc_change A B
+50% -...
0
votes
0
answers
12
views
Overwrite handwriting on Android (input to word processor, math-aware)
I am trying to save some trees.
In order to do this, I would like to develop a system for true handwriting that can be used on Android.
In order to do this, this would have to be an overwrite ...
0
votes
0
answers
22
views
Text classification: Elastic or Apache Spark?
I would like to train a model of text classification to analyze municipal council resolutions of cities.
A resolution usually spoke about: housing, sanitation, local public transportation...
Each ...
2
votes
0
answers
36
views
Extracting font from a handwritten pdf file
I'm wondering if it's possible to extract (create actually) the font from a handwritten pdf file. Online I found only methods to create a font starting from my own handwriting.
There are projects like ...
1
vote
0
answers
16
views
Looking for tools to summarize or quiz from large PDF/text files
Claude AI chatbot provides the ability to summarize PDF/texts. But it still has a limit, usually if one wants to create a summary of a book the pdf/text file has a lot of words exceeding the limit ...
0
votes
1
answer
74
views
OCR extraction from scans
I really need your help!
I have been tasked by my university to digitise a whole bunch of historical data. These are old scans from books and my task would be to copy the data by hand.
Since this task ...
0
votes
0
answers
13
views
open source textual preprocessor recommendation (GPL compatible)
I am seeking for some open source (GPLv3+ friendly and compatible) textual preprocessor for Linux (for the RefPerSys inference engine project)
The need is to generate an input to GNU bison.
FYI, GNU ...
0
votes
0
answers
26
views
Analysing the meter used in poetry
I'm looking for a tool which can take a poem, presumbably in the form of a text file, and annotate it with useful information related to the meter used in the poem, such as
Number of lines in each ...