Skip to content
This repository has been archived by the owner on Mar 9, 2021. It is now read-only.

Commit

Permalink
Fixes pagination detection in Tumblr likes.
Browse files Browse the repository at this point in the history
Fixes the pagination detection in the Tumblr likes crawler for blogs that contain a dash in the name.
  • Loading branch information
johanneszab committed Oct 11, 2018
1 parent 5970aca commit ac148b3
Showing 1 changed file with 1 addition and 1 deletion.
Original file line number Diff line number Diff line change
Expand Up @@ -215,7 +215,7 @@ private long ExtractNextPageLink(string document)
// <a id="next_page_link" href="/liked/by/wallpaperfx/page/5/1457139681" class="next button chrome blue">Next</a></div></div>

long unixTime = 0;
var pagination = "(id=\"next_page_link\" href=\"[A-Za-z0-9_/:.]+/([0-9]+)/([A-Za-z0-9]+))\"";
var pagination = "(id=\"next_page_link\" href=\"[A-Za-z0-9_/:.-]+/([0-9]+)/([A-Za-z0-9]+))\"";
long.TryParse(Regex.Match(document, pagination).Groups[3].Value, out unixTime);
return unixTime;
}
Expand Down

0 comments on commit ac148b3

Please sign in to comment.