PyWeb-IL Presentation on Harvesting: Finding the Most Influential Artists

Yesterday I gave a presentation on harvesting to the PyWeb-IL group. In the presentation, I described what I learned about harvesting and also gave a concrete example of how to find the “most influential artists” using data from allmusic.com and a (very) naive implementation of PageRank. The PageRank implementation was based on wikipedia word-by-word, and … Continue reading PyWeb-IL Presentation on Harvesting: Finding the Most Influential Artists