Subject | Re: [Firebird-general] Re: Wikipedia and Firebird? |
---|---|
Author | Lester Caine |
Post date | 2005-02-24T08:16:37Z |
Artur Anjos wrote:
the way wikipedia have structured the data I would say that it needs
major surgery to make it much more manageable given the large volume of
data.
It looks like a nice source of raw data for playing with Full Text
Search though ;)
--
Lester Caine
-----------------------------
L.S.Caine Electronic Services
>>I'm not 100% sure, but I assume for everything but size andI've got rsync running in the background at the moment, but looking at
>>performance testing, it would be sufficent to start with the database
>>dump of a single language Wikipedia. For example the dumps of the
>>Romanian Wikipedia are only 63MB, see:
>>
>>http://download.wikimedia.org/archives/ro/
>
> If someone does the job to port Wikipedia, I can download all the files
> and send it in a CD to anywhere in the world.
> There are alternative ways to send files :-))
>
> Next, we need someone to host this somewhere.
the way wikipedia have structured the data I would say that it needs
major surgery to make it much more manageable given the large volume of
data.
It looks like a nice source of raw data for playing with Full Text
Search though ;)
--
Lester Caine
-----------------------------
L.S.Caine Electronic Services