Improve SLM coding capabilities

Témavezető: Virág Fausztin Asztrik
Lain Consulting Kft.
email: fuszti@gmail.com

Projekt leírás

The Nvidia's recent paper encourage the use of small language models (SLM) in agentic frameworks. There are many good agentic frameworks for programming, like Opencode. The good part of the programming tasks is that they provide environments, that can be tested automated, like is the code running or not. (This is the core idea of the AlphaEvolve RL algorithm.) Of course the correctness of the generated code can be challenging. This project's goal is exploring fine-tuning possibilities to get specialized SLM-based AI agents that can write a specific code family very well. An interesting example code family can be music generator python codes, for example the musicpy is a python library that can generate musics. But it is just a suggestion, we can specify other use-cases.

Előfeltételek

Advenced programming skills Experiences with neural networks. Basic knowledge of reinforcement learning.

Hivatkozások

https://arxiv.org/abs/2506.02153 https://arxiv.org/abs/2402.03300 https://arxiv.org/abs/2506.13131 https://github.com/sst/opencode https://github.com/Rainbow-Dreamer/musicpy