Github Copilot investigation
2022-10-18 18:33:08.613566+02 by
Dan Lyke
2 comments
This is fantastic: Github Copilot investigation:
If Microsoft and OpenAI chose to use these repos subject to their respective open-source licenses, Microsoft and OpenAI would’ve needed to publish a lot of attributions, because this is a minimal requirement of pretty much every open-source license. Yet no attributions are apparent.
Therefore, Microsoft and OpenAI must be relying on a fair-use argument. In fact we know this is so, because former GitHub CEO Nat Friedman claimed during the Copilot technical preview that “training [machine-learning] systems on public data is fair use”.
But a hell of a lot of that code is GPL or LGPL licensed and one can apparently recreate it with the right prompts...
[ related topics:
Interactive Drama Humor Microsoft moron Machinery Trains Joss Whedon - Serenity / Firefly
]
comments in ascending chronological order (reverse):
#Comment Re: Github Copilot investigation made: 2022-10-19 00:15:30.634253+02 by:
spc476
I wonder if Microsoft would change their stance if Windows source code started showing up via Copilot.
#Comment Re: Github Copilot investigation made: 2022-10-19 18:27:19.364279+02 by:
Dan Lyke
Yeah, I thought that the various Windows frameworks and source code would make a fantastic training set for
people developing for Windows...
We will not edit your comments. However, we may delete your
comments, or cause them to be hidden behind another link, if we feel
they detract from the conversation. Commercial plugs are fine,
if they are relevant to the conversation, and if you don't
try to pretend to be a consumer. Annoying endorsements will be deleted
if you're lucky, if you're not a whole bunch of people smarter and
more articulate than you will ridicule you, and we will leave
such ridicule in place.