Skip to content

Project Ideas Improve Go package license detection

Philippe Ombredanne edited this page Apr 11, 2021 · 4 revisions

Improve Go package license detection

The goal of this project is to improve Go package license detection across the board. scancode-toolkit's Go package detection is not great because Go has no structured metadata and therefore no declared_license. There are some limited license information in https://pkg.go.dev/ which is detected with a simple checker that we be able to scrape as this is not yet exposed in https://github.com/golang/go/issues/36785#issuecomment-773812872

This project would be a mix of adding new license detection rules to scancode, adding new and improved code to handle the specific patterns of license, creating new license mappings and possibly working with upstream maintainers to improve their license declarations. The approach should be to start with a complete data set of all package README and license files and to find patterns of license issues and establish the baseline, possibly with classifiers and ML. The end results should be a significant improvement to the license detection quality for the Go packages.

Clone this wiki locally