Hey, I'm doing a music recommendation software (and learning TCL in the process) but I want to filter the submissions to avoid garbage. Anyone knows a way to do so? The filtering function would take an artist/album couple and check if it exists. Would be better if it could be done locally, but I somehow doubt it's possible to get such a free database.
>>1576 Tcl is a very nice language, good luck on learning it. Libraries are generally bad though, many are ancient, broken or incompatible.
As for filtering garbage, I'd recommend first adding a simple captcha to stop people from using bots to spam. Even if this isn't a part of your threat model, it is very useful to at least have the option available.
Secondly, you could scrape wikipedia for lists of artists and their albums. This would work extremely well for the popular ones at the very least.
Scraping wikipedia is also quite easy since they have lists of shit as articles in addition to a downloadable archive of theif text.
You will need to parse them into a condensed format for more speed, which shouldn't be hard.
>>1577 Wikipedia is pretty poor on content, though. I'd probably have more luck with discogs or last.fm
I'll do it in TCL to learn it, but the final software will probably be in C, since the algorithm uses a graph with a node per album and needs both an adjacency matrix and adjacency lists.
Hey, I'm doing a music recommendation software (and learning TCL in the process) but I want to filter the submissions to avoid garbage. Anyone knows a way to do so? The filtering function would take an artist/album couple and check if it exists. Would be better if it could be done locally, but I somehow doubt it's possible to get such a free database.