zip (was: Re: Disk archival techniques)

Dwight K. Elvey dwight.elvey at amd.com
Thu May 19 13:12:15 CDT 2005


>From: "Vintage Computer Festival" <vcf at siconic.com>
>
>On Thu, 19 May 2005, Dwight K. Elvey wrote:
>
>>  In any case, these are all academic in comparison to the problems
>> of indexing. I don't even have the beginings of how to deal
>> with that problem.
>
>Google :)
>

Hi
 It works surprisingly well but it still misses a lot.
Like when I was looking for the data sheets of the WD1100V-01.
The information was out there, it just wasn't indexed.
Most document writing programs today have that automatic
indexing by marking things as you go along to place in
the index. It requires that someone actually realizes
what needs to be indexed. Then comes the problem of cross
references. Add to that synonyms.
 I was looking through the directories of one of the images
I'd captured from the Polymorphic stuff and found that
a disk labeled "GAMES" contained a version of Forth.
That may have been the persons personal feelings
about it but it was not good indexing.
 My guess is that Google is missing 90 to 95% of the
relevant information out there. If you include site links
on individual pages that improves to about 85% at best.
 Now, add to that the problem of something that exist
but gets somehow placed in the wrong place.
 Indexing will be the biggest challenge!
Dwight




More information about the cctalk mailing list