decoder-model

Star

Here are 28 public repositories matching this topic...

shivendrra / SmallLanguageModel

Star

a LLM cookbook, for building your own from scratch, all the way from gathering data to training a model

machine-learning transformer neural-networks gpt bert-model decoder-model llms llm-training llm-cookbook

Updated Jun 25, 2024
Jupyter Notebook

logic-OT / Decoder-Only-LLM

Star

This repository features a custom-built decoder-only language model (LLM) with a total of 37 million parameters 🔥. I train the model to be able to ask question from a given context

nlp computer-vision deep-learning inference transformer attention-mechanism decoder-model large-language-models llm small-models

Updated Aug 27, 2024
Jupyter Notebook

partarstu / transformers-in-java

Star

Experimental project for AI and NLP based on Transformer Architecture

java nlp ai transformers transformer dl4j encoder-decoder-model self-attention encoder-network decoder-model samediff

Updated Jan 1, 2024
Java

aiden200 / GPT3_Implementation

Star

Implementation of the GPT-3 paper: Language Models are Few-Shot Learners

machine-learning transformer gpt-3 decoder-model llm

Updated Aug 20, 2025
Python

LaurentVeyssier / Image-Captioning-Project-with-full-Encoder-Decoder-model

Star

Generate caption on images using CNN Encoder- LSTM Decoder structure

encoder pytorch lstm image-captioning bleu-score rnn-encoder-decoder caption-generation rnn-lstm decoder-model

Updated Aug 26, 2020
Jupyter Notebook

SharathHebbar / Transformers

Star

Transformers Intuition

transformers embeddings semantic-similarity sequence-to-sequence tokenization attention-is-all-you-need encoder-decoder-model encoder-model decoder-model masked-language-models causal-language-modeling

Updated May 12, 2025
Jupyter Notebook

HxCodeWarrior / StellarByte

Star

从零实现基础的Transformer的Decoerder-Only模型，并进行模型升级，构建专属于自己的LLM模型

nlp transformers decoder-model llm

Updated Sep 21, 2025
Python

mbnczy / GenAI4SeqCls

Star

Generative AI fine-tune and inference for sequence classification tasks

classification sequence-classification fine-tuning decoder-model llm generative-ai genai finetune-llms

Updated Sep 8, 2025
Python

KempnerInstitute / minOLMo

Star

An explainable and simplified version of OLMo model

transformer decoder-model olmo

Updated Mar 5, 2025
Jupyter Notebook

shivendrra / enigma

Star

a dna sequence generation/classification using transformers

transformer seq2seq sequence-to-sequence gpt dna-sequences dna-sequencing encoder-decoder-model bert-model decoder-model llm dna-bert

Updated Jan 20, 2025
Jupyter Notebook

DaniyalAhmedKhan1234 / Academic-Text-Simplification

Star

This project aims to simplify texts from research papers using advanced natural language processing (NLP) techniques, making them more accessible to a broader audience

embeddings text-processing attention-mechanism research-paper nlp-machine-learning bleu-score paraphrase-generation sari decoder-model