Skip to content

No way to obtain LicenseMatch origin #3608

@srehm

Description

@srehm

Description

I was scanning the subversion package from Debian Bullseye (1.14.1-3+deb11u1) with scancode 32.0.8. After the scan completed, I noticed bogus license detections in at least one file: build/ac-macros/swig.m4
Not only are there 51 (if I counted correctly :)) matches from which only the first one is correct. There are also matches in line ranges that don`t even exist in the file (e.g. the file has 360 lines and there are detections on lines 400+).

What makes this even stranger is the fact that in my tests I could not reproduce the error by scanning that file separately. It only happens if I scan the complete source tree. Almost as if the matches are pulled in from other files.

For your convenience I have attached both the sourcecode and the result json.

sourcecode.zip
result.zip

How To Reproduce

  • Download scancode v32.0.8 package for python 3.9 from the release page
  • Extract and configure
  • Download and extract the attached source code
  • Change directory to the extracted scancode package
  • Run: .\scancode.bat -cli --license-references --license-score 65 --strip-root -n 6 --verbose --json-pp result.json /path/to/extracted/source

System configuration

For bug reports, it really helps us to know:

  • What OS are you running on? Windows + Linux
  • What version of scancode-toolkit was used to generate the scan file? v32.0.8
  • What installation method was used to install/run scancode? Downloaded from release page

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions