Skip to main content

All Questions

Tagged with
13 votes
2 answers
22k views

How to convert HTML to text?

How it is possible to convert HTML to text file in Linux? For example I want to curl a query to Google, then convert the output html to text and read converted text on my terminal. I am using RHEL6.
rivu's user avatar
  • 271
8 votes
3 answers
6k views

HTML to SVG conversion?

I would like to convert some somewhat straightforward web pages (no javascript, minimal CSS) into SVG for archiving. I am wondering if there is a suggested tool or workflow for this conversion? My ...
jedierikb's user avatar
  • 529
7 votes
1 answer
1k views

Batch convert Microsoft Word files to HTML files

How can Microsoft Word files (.doc, .docx) be converted to HTML files as a batch process running on a Linux server?
z12345's user avatar
  • 193
4 votes
2 answers
925 views

Semantic PDF to HTML conversion

I would like to convert a PDF document to a collection of HTML pages that exhibit 'clean' markup, and generate/keep semantic info (chapters, sections...), as well as perform cleanup tasks (e.g. I am ...
Rom1's user avatar
  • 161
3 votes
8 answers
6k views

PDF to HTML - batch converter - most reliable and accurate free AND paid for software? [closed]

I'm look for either a free or paid-for (about 50$/40pounds) BATCH PDF to HTML converter to convert several PDF files at once. Needs to be able to handle vectored and bitmap images within the file, ...
therobyouknow's user avatar
3 votes
2 answers
4k views

batch convert htm files to pdf

What applications allow the user to batch convert HTM files into PDF?
dreftymac's user avatar
  • 425
3 votes
1 answer
1k views

Pandoc options to convert LaTeX flavored Markdown to HTML

The aim is to convert a Markdown file (file.md) to a html file (file.html). The Markdown file contains formulas specified in LaTeX specified between two dollar ($) signs. Example: The resulting ...
willeM_ Van Onsem's user avatar
3 votes
5 answers
3k views

How can I convert HTML to pdf?

I want to read and annotate internet articles like books on my iPad so I would like to convert HTML to PDF. Is there a way of doing this that preserves every font as is can make PDF out of selection ...
kissgyorgy's user avatar
3 votes
1 answer
5k views

Word to HTML Conversion- Loss of image quality

I have a word document which has formulas and other things as images.Now when I convert this document to a HTML file (Save as Webpage) the images experience a loss in quality. This is negligible in ...
Wang Liqin's user avatar
3 votes
1 answer
2k views

Convert .doc or .rtf to clean HTML on OS X

When I export a file from Word or TextEdit, I get very bloated HTML, full of crazy style tags on every paragraph, so I can't even clean it by hand. The only information I want preserved is: <h1&...
iDontKnowBetter's user avatar
2 votes
8 answers
17k views

How do I convert Word files or HTML to CHM or PDF?

I have a piece of software that currently packages an MS Word file as the user guide/help. I would like to make this into either a PDF or a CHM file. I do not wish to re-write the help or user ...
tim's user avatar
  • 282
2 votes
5 answers
2k views

How can I convert a large number of Word documents to HTML as fast as possible?

I have to convert 500 Microsoft Word 2003 files into HTML documents. What would be the shortest possible way? I'm not just talking about extension .doc to HTML. I want to convert word files's data ...
metal gear solid's user avatar
2 votes
2 answers
17k views

Batch convert all html files in a directory into pdf files in windows? [closed]

I see one solution so far: http://www.htmldoc.org/ Are there any more out there which are suggested?
Tal Galili's user avatar
  • 3,395
2 votes
3 answers
5k views

Batch convert .doc files to .txt (plain ascii text) and/or .html recursively in folders and subfolders, Windows and Mac?

Is there a tool to do this. I've seen some Python/Java tools to automate OpenOffice but has anyone reliably scripted this to do more than one file, and recurse through a folder/directory tree with ....
therobyouknow's user avatar
2 votes
3 answers
25k views

Convert a Entire Folder To PDF

I want to turn an entire folder(R Tutorial) that is a web site(written in HTML only) that I've already downloaded all the tree to my computer, but I want to turn this into a single PDF. Someone knows ...
Nathan Campos's user avatar
2 votes
2 answers
2k views

Prevent Excel from converting values to dates when opening HTML files

I have some HTML files that contain tables, which I need to perform some analysis on. I can open them in Excel, and it preserves all the table formatting and layout (which is what I want). The ...
Some_Guy's user avatar
  • 774
2 votes
3 answers
2k views

Convert html2pdf with toc, color and unicode support?

Is there a way I can convert large html file (produced with sphinx by the way) to pdf with color, table of contents (toc) and unicode support? There's htmldoc -- but it neither support color, nor ...
Adobe's user avatar
  • 2,849
2 votes
1 answer
928 views

Is it possible to convert an EPUB file to HTML? [closed]

There are many questions and answers for reverse direction of converting HTML to EPUB but what about converting an EPUB file to HTML? There are many tools that can extract CHM (Microsoft Compiled HTML ...
imida k's user avatar
  • 219
2 votes
0 answers
56 views

Convert HTML pages to PDF, but after applying ublock origin filters

I want to convert a bunch of webpages to PDF using wkhtmltopdf. But I want the uBlock Origin personal filtering under My Filters (currently being used in my browser) be applied on each HTML page. How ...
user13107's user avatar
  • 303
1 vote
1 answer
5k views

How to make all table borders invisible in MS Word after copying from HTML

I am in a situation where I need to make a HTML report into a word report with nothing more that Ctrl+C or opening it with Word. I end up with a lot of nested tables. Problem lies in the fact that ...
TheBW's user avatar
  • 341
1 vote
1 answer
1k views

How to convert HTML tags to RTF or any rich format text from the Linux command line

How can I convert HTML tags to rtRTF or any rich format text using sed or any linux command-line tool? I've achieved to strip them with sed 's/<[^>]*>//g', but I need the <b>hi</b&...
aemonge's user avatar
  • 121
1 vote
2 answers
5k views

Chrome extension to change HTML tags automatically when loading pages?

Is there any Chrome extension that intercepts a page loading and converts specific tags before showing the final page? For example, if some page contains <br> and I want it to be <p> ...
Rogério Dec's user avatar
1 vote
1 answer
3k views

Looking for tex to html converter

I need to convert a very large latex project (made up of many .tex and style files) into .html (or something similarly non-.pdf). Can someone recommend a quality converter program? Preferably, one ...
Stephen's user avatar
  • 311
1 vote
1 answer
755 views

How to transform html ebooks into pdf ebooks for mac osx

There is an ebook format that basically consists of a bunch of html pages, one for each book page, all in a folder, and an html page outside this folder called start_here.html . This is easy to read ...
Pietro Speroni's user avatar
1 vote
1 answer
225 views

Program to read and convert entire website to html [duplicate]

Possible Duplicate: How can I download an entire website I'm using a very old cms, Synkron.web. I need to convert all of my pages (written in .asp) to plain html - so I have an archive where all ...
Frederik Wordenskjold's user avatar
0 votes
2 answers
202 views

Convert PDF files to linked HTML files

Does anyone know of a way to convert a PDF file to HTML files, 1 file per page? If the pages can be linked with each other, that is, page 10 contains links to pages 9 and 11, for easier browsing, ...
Waleed Hamra's user avatar
0 votes
1 answer
324 views

Convert many HTML files into several PDF files [closed]

Is there a tool or a program for Windows with which I can convert a lot of HTML files (approx. 3000) into individual PDF files? I can only find requests for a single PDF everywhere, but I need to ...
YL73's user avatar
  • 11
0 votes
1 answer
34k views

How to save full webpage as a pdf with images, etc, in a muti page format?

So, I am trying to take my e-portfolio and make a pdf version because my uni is playing hot potatoe with the website keeping my portfolio. I wanted links preserved as well as documents, but in the end,...
Sammy-Jo Watt's user avatar
0 votes
1 answer
92 views

Stripping and reformatting specific HTML tags from content

I'm currently working on a study involving Stack Exchange content and trying to find an efficient way to bring the content into my CAQDAS. The CAQDAS I'm using is DeDoose. My issue is that the ...
curious's user avatar
  • 161
0 votes
1 answer
2k views

Convert html, pdf or .doc format to google doc without breaking formatting

I'm building some documents which need to live online on google docs. I start off by building static html files using Ruby on Rails. These look fine. Then I convert them to a pdf using wkhtmltopdf. ...
Max Williams's user avatar
  • 3,039

15 30 50 per page