Am I the only one thinking that GPT3 and CoPilot can actually work once trained on properly licensed and properly audited code?
Well it will not be as ubiquitous as having all the github under your fingers, but perhaps is anyway better not to blindly cite the world's source code.
> Am I the only one thinking that GPT3 and CoPilot can actually work once trained on properly licensed and properly audited code?
Sadly you aren't. The truth is, however, that models like GPT3 and its derivatives like Codex/CoPilot are by design incapable of ever achieving this.
The only way to generate both correct and secure code is to use a combination of proper specs and theorem provers.
Even then this won't help with non-functional requirements, such as performance or platform-dependent resource constraints.
Generative models will always have the potential to yield broken code that doesn't do what you want or contains security flaws even if trained on "proper" code.
If I have to audit the code that CoPilot generates and if the code is as obfuscated as the Easter example, it's probably less useful than it says on the label...
Well it will not be as ubiquitous as having all the github under your fingers, but perhaps is anyway better not to blindly cite the world's source code.