Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Use Hadoop Filesystem Path URI instead of java.net.URI #67

Merged
merged 1 commit into from
Feb 5, 2020

Conversation

morazow
Copy link
Contributor

@morazow morazow commented Feb 5, 2020

The java.net.URI cannot resolve the paths that contain regex patterns,
for example, 's3a://bucket/{year=2019/month=1,year=2019/month=2}/*'.

Instead we should use Hadoop Filesystem Path, and then convert the path
to URI using .toUri() method.

For example, to obtain the schema from a path:

new Path(bucketPathString).toUri.getSchema

The `java.net.URI` cannot resolve the paths that contain regex patterns,
for example, `'s3a://bucket/{year=2019/month=1,year=2019/month=2}/*'`.

Instead we should use Hadoop Filesystem Path, and then convert the path
to URI using `.toUri()` method.

For example, to obtain the schema from a path:

```scala
new Path(bucketPathString).toUri.getSchema
```
@morazow
Copy link
Contributor Author

morazow commented Feb 5, 2020

Fixes #65

@morazow morazow merged commit 451d30c into master Feb 5, 2020
@morazow morazow deleted the bug/#65-fix-regex-path-matching-error branch June 21, 2020 17:32
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant