Issue with cloner #288

sanskar11 · 2021-04-06T19:00:52Z

When I was trying to clone my website http://researchweb.iiit.ac.in/~sanskar.tibrewal, there was issues with clonning, as it was cloning researchweb.iiit.ac.in instead of the link provided. On digging deep I foung the issue lies in the yarl.
For example:
`

a=yarl.URL("../index.html")
b=yarl.URL("http://researchweb.iiit.ac.in/~sanskar.tibrewal")
b.join(a)
Output:
URL('http://researchweb.iiit.ac.in/index.html')
`

Therefore the ~sanskar.tibrewal part get cut out.

Similar code can be found at:

Here self.root is the b and url is the a from the above example.

One of the fix is to rewrite the code and keep updating the self.root. Another fix is to use regular expressions or parsers for parsing out the dots and finding the correct links.

glaslos · 2021-06-12T12:25:06Z

@sanskar11 this was done to get the root of the website. You want a specific folder which wouldn't really work. We could move your sub-folder into the root of the website?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Issue with cloner #288

Issue with cloner #288

sanskar11 commented Apr 6, 2021

glaslos commented Jun 12, 2021

Issue with cloner #288

Issue with cloner #288

Comments

sanskar11 commented Apr 6, 2021

glaslos commented Jun 12, 2021