Lowlander wrote:That's finding the lyrics, then you still have to extract them which will most likely be different for each site.
Right, that's what I meant by 'define a pattern to specify where in the body of the resulting page the lyrics themselves would be'. So, each site would have a page structure and have the lyrics themselves in a specific location.
So, for the first site I gave as an example (letras.terra.com) the lyrics seem to be located between <div id="div_letra"> and </div>.
The third site I gave as an example (
www.azlyrics.com), seems to have the lyrics between <!-- start of lyrics --> and <!-- end of lyrics -->
Some sites may not work with such simple rules, but it looks like many would - for example I just randomly picked a couple more lyrics sites:
www.elyrics.net -> lyrics between </small><br><br> and <br><br><br>
www.lyricsmode.com -> lyrics between <div id='songlyrics_h' class='dn'> and <span class="b-lyrics-from-signature">
www.lyricsfreak.com -> lyrics between <div id='content_h' class='dn'> and <span class="b-lyrics-from-signature">
www.songmeanings.net -> lyrics between <div id="textblock" style="z-index: 1;"> and </div>
[quote="Lowlander"]That's finding the lyrics, then you still have to extract them which will most likely be different for each site.[/quote]
Right, that's what I meant by 'define a pattern to specify where in the body of the resulting page the lyrics themselves would be'. So, each site would have a page structure and have the lyrics themselves in a specific location.
So, for the first site I gave as an example (letras.terra.com) the lyrics seem to be located between <div id="div_letra"> and </div>.
The third site I gave as an example (http://www.azlyrics.com), seems to have the lyrics between <!-- start of lyrics --> and <!-- end of lyrics -->
Some sites may not work with such simple rules, but it looks like many would - for example I just randomly picked a couple more lyrics sites:
http://www.elyrics.net -> lyrics between </small><br><br> and <br><br><br>
http://www.lyricsmode.com -> lyrics between <div id='songlyrics_h' class='dn'> and <span class="b-lyrics-from-signature">
http://www.lyricsfreak.com -> lyrics between <div id='content_h' class='dn'> and <span class="b-lyrics-from-signature">
http://www.songmeanings.net -> lyrics between <div id="textblock" style="z-index: 1;"> and </div>