How to obtain the pdf and ppt file text content? To support the Chinese, thank you!

2011-11-02 06:00:14

I found some plug-ins only support English, would like to find support Chinese plugin, which experts have done this thing,
want to enlighten twelve,!

2011-11-02 06:15:02
have this API, you look for
2011-11-02 06:18:40
Ha, the tone is not small! To generate pdf

can also be extracted from pdf to accommodate? Unlikely
As you can see the apache poi ppt projects with an estimated unlikely!
2011-11-02 06:24:23
extract content from word document can be done poi, Chinese is also no problem, we have tried, and so on ppt should also be possible.
as pdf do not know, help you up a
2011-11-02 06:32:30
right ppt: ppt using vb or vc create objects, you can convert ppt rtf, later word object rtf convert txt. Then
java call, well, I have succeeded.
as pdf, good fan ..., who knows tell me
