Headache: Making an HTML document into a Text document
Posted on 8:53 pm by Andrew WilsonContent for our sites comes in all sorts of formats and types, often a whole folderfull at once, and all to go on a website as soon as possible.
Today I needed to import a load of articles into a site, but they had to be changed a little - 120 or so articles. The articles were all HTML pages, I needed to insert some non standard characters and then remove all the HTML tags and save as text files. The very thought of doing this by hand gave me a cold sweat!
I had no tools for the job, but I was lucky, Google led me to NoteTab a text editor that is almost as useful in its free version as its not very expensive paid for version. With NoteTab I was able to open all 120 files in the folder at one go, strip out all the HTML tags, but keep all the URLs intact and insert my special characters. All the tags were stripped in a couple of seconds and the documents saved as text files in the same folder as they had come from. Once I had discovered how to do the job it took perhaps a minute from start to finish to do what would have taken perhaps a couple of hours or more by hand.
This is a very useful tool, for free and just $19.95 to upgrade. Bargain!




