Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Issue with cloner #288

Open
sanskar11 opened this issue Apr 6, 2021 · 1 comment
Open

Issue with cloner #288

sanskar11 opened this issue Apr 6, 2021 · 1 comment

Comments

@sanskar11
Copy link
Contributor

When I was trying to clone my website http://researchweb.iiit.ac.in/~sanskar.tibrewal, there was issues with clonning, as it was cloning researchweb.iiit.ac.in instead of the link provided. On digging deep I foung the issue lies in the yarl.
For example:
`

a=yarl.URL("../index.html")
b=yarl.URL("http://researchweb.iiit.ac.in/~sanskar.tibrewal")
b.join(a)
Output:
URL('http://researchweb.iiit.ac.in/index.html')
`

Therefore the ~sanskar.tibrewal part get cut out.

Similar code can be found at:

image

Here self.root is the b and url is the a from the above example.

One of the fix is to rewrite the code and keep updating the self.root. Another fix is to use regular expressions or parsers for parsing out the dots and finding the correct links.

@glaslos
Copy link
Member

glaslos commented Jun 12, 2021

@sanskar11 this was done to get the root of the website. You want a specific folder which wouldn't really work. We could move your sub-folder into the root of the website?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants