D&C GLug - Home Page

[ Date Index ] [ Thread Index ] [ <= Previous by date / thread ] [ Next by date / thread => ]

[LUG] Trainable web site picker...

 

I've just been playin with Audiveris which is a well cool (showing my age her) 
Java app that takes a sheet music image and converts it to Midi or musicxml 
so someone like me who cant seem to learn to read sheet music can play 
scores.
There are quite a few archives out there with out of copyright material 
available and I'd like to try converting a lot to MusicXML.
I'd like to automate the downloading of the images but get rid of the 
detritus.
I want a trainable spider that I can show the 'root' page of the collection, 
click on a table or ddl and set that as the repeat action, then go down to 
another level and get to (say composer) level, make a local directory, then 
click to a song, make a local directory, drill down and get the associated 
image(s), return to composer get next song, , back to root get next part of 
collection.......
It occured to me something like this might also be useful for pulling prices 
from supermarket web sites for a comparison site as they seem to change there 
arrangments to try and make this difficult - 'Competition? We love it we just 
do everything we can to stop it...'

Tom te tom te tom


-- 
The Mailing List for the Devon & Cornwall LUG
http://mailman.dclug.org.uk/listinfo/list
FAQ: http://www.dcglug.org.uk/linux_adm/list-faq.html