Hi Khalida,
In a previous message, you mentioned that the speed of your Internet
connection and the storage capacity of your computer were giving you
trouble.
I know this is not directly on-topic on this list, but since you seem to
have tried and exhausted many options already, perhaps you would
consider running your code on a cloud-based computer, such as a server
from Amazon EC2? Doing so would allow you to get around both connection
and storage issues, and perhaps allow you to run your Java code
successfully, or perhaps to run JWPL successfully.
Using EC2 is not particularly simple, but it is technically
straightforward and not too hard, either. The Getting Started Guide is
here:
http://docs.amazonwebservices.com/AWSEC2/latest/GettingStartedGuide/
The pricing is quite inexpensive if you limit the number of hours you
use the servers. There is even a free tier for new users
<http://aws.amazon.com/free/>, although with your needs you might choose
to pay (again, a small amount) for server(s) with larger capacity.
I can offer a small bit of EC2 guidance off-list if you need other
pointers in getting started.
Pete
On 12/2/11 14:03 PM, Khalida BEN SIDI AHMED wrote:
I need an Html dump of Wikipedia because I have
written a java code which
extract text from an html content and I would like to apply it on this
dump. In fact I need to extract the first sentence of a list of articles
(<200) and I don' know how to do it on other dumps. If you have any idea of
other solutions, I will be pleased if you share them with me.