Saw this microgpt.
200 Lines of Python, a 4192 parameter Generative Pre-trained Transformer.
There’s some good information here on this type of GPT model, which helps people understand the problems of building these LLM/GPT’s.
My thoughts were, could this be recoded into Clarion? Probably, we have a maths library so should be able to handle the statistical side of things.I think that would be a nice little side project in itself.
The training data would appear to be key.
The cloud data centres make the LLM’s like ChatGPT or Claude possible, but are simply too big and expensive to run using on-prem hardware, so a smaller version geared towards dedicated tasks beit generating clarion code or analysing customer data whilst providing privacy might be an alternative solution. Cue microgpt.
Its suggested this will be too small and will need to be scaled up.
Towards the end at the link, the section titled “Real stuff” covers much of what is required to make this a usable GPT, but this is where you create & train what you need not what OpenAi or Anthropic think is needed.
In some ways, you have the base to apply your own creative direction to create a specialised Ai for your own coding needs or your end user’s need’s.
In the FAQ’s, I particularly like this…
What’s the deal with “hallucinations”? The model generates tokens by sampling from a probability distribution. It has no concept of truth, it only knows what sequences are statistically plausible given the training data. microgpt “hallucinating” a name like “karia” is the same phenomenon as ChatGPT confidently stating a false fact. Both are plausible-sounding completions that happen not to be real. What’s the deal with “hallucinations”? The model generates tokens by sampling from a probability distribution. It has no concept of truth, it only knows what sequences are statistically plausible given the training data. microgpt “hallucinating” a name like “karia” is the same phenomenon as ChatGPT confidently stating a false fact. Both are plausible-sounding completions that happen not to be real.
What are other peoples thought on microGPT being used as a base to develop their own ?LLM? GPT Ai for their own needs?
Worth the effort?