[a / b / c / d / e / f / g / gif / h / hr / k / m / o / p / r / s / t / u / v / vg / vm / vmg / vr / vrpg / vst / w / wg] [i / ic] [r9k / s4s / vip] [cm / hm / lgbt / y] [3 / aco / adv / an / bant / biz / cgl / ck / co / diy / fa / fit / gd / hc / his / int / jp / lit / mlp / mu / n / news / out / po / pol / pw / qst / sci / soc / sp / tg / toy / trv / tv / vp / vt / wsg / wsr / x / xs] [Settings] [Search] [Mobile] [Home]
Board
Settings Mobile Home
/g/ - Technology


Thread archived.
You cannot reply anymore.


[Advertise on 4chan]


File: 1752672500923401.png (647 KB, 2360x2824)
647 KB PNG
It's a common fact that ALL LLM models were trained on a public accessed repositories, which are licensed either on GPL or MIT/BSD/Apache licenses. Unless AI companies are explicitly breaking the law they can't train their models on proprietary code.
Thus, it's only fair to force all LLM code providers to license generated code under GPL/AGPL since it's impossible to differentiate if data for generated code was licensed on GPL or BSD.
>>
File: prop.png (245 KB, 1080x877)
245 KB PNG
>>108637623
AI models should be open source.
>>
>>108637670
They are already open-source since they're trained on open-source data. You're already have all access to all data it was trained on.
What you meant is probably they're must publicly available like Deepseek? Well, that would be great, but idk what you're going to do with 600B model, it's no like you can self-host it.
>>
>>108637688
>idk what you're going to do with 600B model, it's no like you can self-host it.
I will simply wait until it finishes processing a response.
>>
spoiler: they are 100% training on proprietary code
the data sets are not sanitized based on licensing or anything like that
if the code is source available, it goes in the model
so yes, the government should 100% seize these companies and make them publicly owned, because they stole everyone’s shit, and should not be allowed to profit from it
>>
>>108637688
If the weights were available, it could be trained & also audited for backdoors/biases/censorship.
Obviously that would cost a lot of money but that's besides the point.
>>
>>108637792
>spoiler: they are 100% training on proprietary code
They only train on proprietary code that is available to them. Most proprietary code resides in private self-hosted repositories on corporate servers that you can't from WAN.
>>
>>108637623
AI companies don't need to obey laws.
>>
>>108637623
>outsourcing your brain is...LE GOOD



[Advertise on 4chan]

Delete Post: [File Only] Style:
[Disable Mobile View / Use Desktop Site]

[Enable Mobile View / Use Mobile Site]

All trademarks and copyrights on this page are owned by their respective parties. Images uploaded are the responsibility of the Poster. Comments are owned by the Poster.