Skip to main content

All Questions

Tagged with
2 votes
0 answers
56 views

Convert HTML pages to PDF, but after applying ublock origin filters

I want to convert a bunch of webpages to PDF using wkhtmltopdf. But I want the uBlock Origin personal filtering under My Filters (currently being used in my browser) be applied on each HTML page. How ...
0 votes
1 answer
92 views

Stripping and reformatting specific HTML tags from content

I'm currently working on a study involving Stack Exchange content and trying to find an efficient way to bring the content into my CAQDAS. The CAQDAS I'm using is DeDoose. My issue is that the ...
0 votes
1 answer
2k views

How to convert large .htm file to .pdf?

I have a fairly large .htm file, 100 MB which doesn't fully load in any browser I've tried (the page stops rendering after a certain point), so I want to try converting the file into a .pdf file so ...
1 vote
1 answer
225 views

Program to read and convert entire website to html [duplicate]

Possible Duplicate: How can I download an entire website I'm using a very old cms, Synkron.web. I need to convert all of my pages (written in .asp) to plain html - so I have an archive where all ...
3 votes
1 answer
2k views

Convert .doc or .rtf to clean HTML on OS X

When I export a file from Word or TextEdit, I get very bloated HTML, full of crazy style tags on every paragraph, so I can't even clean it by hand. The only information I want preserved is: <h1&...
8 votes
3 answers
6k views

HTML to SVG conversion?

I would like to convert some somewhat straightforward web pages (no javascript, minimal CSS) into SVG for archiving. I am wondering if there is a suggested tool or workflow for this conversion? My ...
2 votes
2 answers
2k views

Prevent Excel from converting values to dates when opening HTML files

I have some HTML files that contain tables, which I need to perform some analysis on. I can open them in Excel, and it preserves all the table formatting and layout (which is what I want). The ...
3 votes
1 answer
1k views

Pandoc options to convert LaTeX flavored Markdown to HTML

The aim is to convert a Markdown file (file.md) to a html file (file.html). The Markdown file contains formulas specified in LaTeX specified between two dollar ($) signs. Example: The resulting ...
4 votes
2 answers
928 views

Semantic PDF to HTML conversion

I would like to convert a PDF document to a collection of HTML pages that exhibit 'clean' markup, and generate/keep semantic info (chapters, sections...), as well as perform cleanup tasks (e.g. I am ...
2 votes
3 answers
5k views

Batch convert .doc files to .txt (plain ascii text) and/or .html recursively in folders and subfolders, Windows and Mac?

Is there a tool to do this. I've seen some Python/Java tools to automate OpenOffice but has anyone reliably scripted this to do more than one file, and recurse through a folder/directory tree with ....
2 votes
2 answers
17k views

Batch convert all html files in a directory into pdf files in windows? [closed]

I see one solution so far: http://www.htmldoc.org/ Are there any more out there which are suggested?
0 votes
2 answers
202 views

Convert PDF files to linked HTML files

Does anyone know of a way to convert a PDF file to HTML files, 1 file per page? If the pages can be linked with each other, that is, page 10 contains links to pages 9 and 11, for easier browsing, ...
0 votes
1 answer
819 views

Convert Large HTML to DOC?

I am programmatically converting an existing HTML file that has been dynamically created to a MS Word document. I've ran into a situation where the MS Word Document object fails to process the HTML ...
3 votes
8 answers
6k views

PDF to HTML - batch converter - most reliable and accurate free AND paid for software? [closed]

I'm look for either a free or paid-for (about 50$/40pounds) BATCH PDF to HTML converter to convert several PDF files at once. Needs to be able to handle vectored and bitmap images within the file, ...
1 vote
1 answer
5k views

How to make all table borders invisible in MS Word after copying from HTML

I am in a situation where I need to make a HTML report into a word report with nothing more that Ctrl+C or opening it with Word. I end up with a lot of nested tables. Problem lies in the fact that ...

15 30 50 per page