The hash cache is too inflexible, in its current on-disk format. It was fine in the past,
where Apache included the ocntents into v3 Metalinks. The snippets on disk were prepared
just for that. However, it's difficult to add further features like
- hashes in HTTP headers
- inclusion of hashes into RFC Metalinks (different format)
- inclusion of hashes into the mirror lists
- building a "hash server" (append .md5 to any URL and get the md5 sum)
So this is blocking several good things that could be done.
Issue 15 contains some ramblings about this, but let's track this change here.
I currently think that moving the hash into the database might be best. It would definitely
a flexible option without the need to invent an on-disk format and write parsers for it.
Also, it would make the data available to a web frontend.
Before the on-disk format is dropped, we can try how well it works with the database.
As a first step, I have now transferred all functionality from the external metalink-hasher
script into the "mb" tool. Thus, now the database functionality is available for no cost.
|