So, few days ago a new technology entered in industry github copilot.
Github copilot is basically a code suggestion software like other software ...
For further actions, you may consider blocking this person and/or reporting abuse
I don't really see how microsoft selling access to an AI assistant on their server impacts open source.
You write code for free. They use it to train their AI and potentially you are the buyer of your own code.
If you want everyone to have access to your code, "everyone" includes people you don't like.
If the license says they can copy and redistribute it then that's exactly what they can do - and that's your choice as a software author.
You're perfectly free to make a license that says your code can only be used by people or companies with less than $1,000,000 turnover if you want, but bigness and richness don't always correlate well with any one person's idea of "morally good".
My opinion is that every developer should be able to choose whether his code can be used in AI or not. And Microsoft or any other company that offers tool similar to copilot should respect that and other licenses
Yes, that would be great
You hire a person, you train them on your code and potentially you are paying the wages of someone reproducing your code.
That is the problem, we have to pay for our code
But you're not paying for your code. You're paying for the resources to run an AI assistant that has gleamed understanding from looking at lots of code examples.
Andrew, there are people like you and me who write code in our own repository on github, microsoft is giving those code for copilot training for their profit.
What about companies that sell products that use React? That's open source and the companies make profit.
Engineer Man had a good video about GitHub Copilot:
youtube.com/watch?v=b9u3ZAGQmT0
thanks, I will check that .
When I open-source my code, it is free for any purpose under an MIT license. I am specifically allowing anyone to use it for anything, I guess that includes Microsoft distributing it to others, if they benefit, even if they had to pay to find it, I'm still happy with that.
The problem starts when autopilot takes code from projects it shouldn't be taking code from because of licensing reasons.
Stuff like that has the potential of causing major legal problems for both github and anyone using auto- I mean, copilot, if it accidentally ends up reproducing code that originally used a license that doesn't permit redistributing under any random license.
That's certainly a good reason not to use copilot :)
So long as copilot does not use private repos for this, I see no issue with this. If your repo is public, regardless what the license is, you're exposing your code and your knowledge to others, and you MUST assume the possibility that, even though not legal in some situations, people will copy and distribute software using your code as a basis. End of story
Open source is too wide to use here. There are many OS licences. In this case, there is OS that allows anybody to do anything with the code (including selling it), and there are licences that do not allow re-selling of the code or proprietary derivative works.
I don't think GitHub would break those licences. So I believe it's just a matter of choosing a right licence for your code.
I do not see a problem with Microsoft potentially monetizing a service that has been trained on open source source code, as long as they do not break any licensing and only use public repositories as training data.
There are plenty of examples of services providing added value on top of other services or data, commercial or non-commercial.
Pick any website that provides some comparison of services and products for you to find the "best deal" for whatever it is, for example.
If licence for source code says it is free to use for commercial and non-commercial use, then that is perfectly fine. Your best bet to potentially avoid Microsoft using your code, use a copyleft license, such as GPL. It will not stop commercial use entirely, but may potentially reduce commercial utilization.
One interesting aspect here though, which I do not have an answer to, is if these various licenses cover using code as data, as opposed to code as code (that is executed). I think they probably do that, but I am not sure.
Microsoft paid $7.5 billion (in stock) for Github. No doubt Github Copilot and many other services were part of a plan to get that investment back and then some.
Technically, others could build the same thing, with public repositories. Not as likely to happen though, without a bunch of compute resources and/or money available.
I don’t think it impacts open source software anyhow. Though I have some anxiety about robots freely accessing the huge code archive that GitHub actually is 😅
Basically, We are paying for our own code. There should be provision of separate license that will prohibit company to train their bot using our code unless we give permission.