数据集:
codeparrot/github-jupyter
The dataset was extracted from Jupyter Notebooks on BigQuery.
Each example has the license of its associated repository. There are in total 15 licenses:
[ 'mit', 'apache-2.0', 'gpl-3.0', 'gpl-2.0', 'bsd-3-clause', 'agpl-3.0', 'lgpl-3.0', 'lgpl-2.1', 'bsd-2-clause', 'cc0-1.0', 'epl-1.0', 'mpl-2.0', 'unlicense', 'isc', 'artistic-2.0' ]