• We’re currently investigating an issue related to the forum theme and styling that is impacting page layout and visual formatting. The problem has been identified, and we are actively working on a resolution. There is no impact to user data or functionality, this is strictly a front-end display issue. We’ll post an update once the fix has been deployed. Thanks for your patience while we get this sorted.

Java open source utilities

Bulldog13

Golden Member
Hello,

I am looking for java programs (with source) that do the following :

Convert Word to text from Java
Convert HTML to text with correct layout and spacing
Call Visual Basic or C++ from Java
~Found kind of here, still need help with calling vb http://www.javaworld.com/javaw...tips/jw-javatip17.html
Convert PDF to text from Java
~Found on sourceforge.net - pdfbox
Convert PowerPoint to text from Java

I am working on a project and we are doing the feasibility analysis of certain features...so that's why I need the open source.
 
Well, I'm not sure what you mean by "to text". Do you mean an ASCII text file?

You might want to check out OpenOffice. It's an open source office suite that can read (and write?) Microsoft Office file formats. The source code may give you an idea about how complex the Word and Powerpoint formats are.
 
By text I mean the stripping of html, java, and img tags and preserving the text formatting as much as possible.

Kind of like what a website *might* look like if you opened it with notepad.

Yes, ASCII text file.
 
Back
Top