Repository for My HuggingFace Natural Language Processing Projects
-
Updated
Aug 31, 2023 - Jupyter Notebook
Repository for My HuggingFace Natural Language Processing Projects
Auto-regressive causal language model for molecule (SMILES) and reaction template (SMARTS) generation based on the Hugging Face implementation of OpenAI's GPT-2 transformer decoder model
Codebase for arXiv:2405.17767, based on GPT-Neo and TinyStories.
Transformers Intuition
A multi-threaded GitHub scraper to collect Python code with docstrings from public repositories, creating a well-documented dataset for the JaraConverse LLM model.
Causal language modeling and intent classification using GPT-2.
A quick and easy way to interact with open-source LLMs.
Course materials for the Machine Learning for NLP course taught by Sameer Singh for the Cognitive Science summer school 2022.
Links to my repositories, where I implement a wide variety of Natural Language Processing models using TensorFlow and Hugging Face.
Dataset and model fine-tuning for function calling
An AI generated picturebook.
Fine-tuning (or training from scratch) the library models for language modeling on a text dataset for GPT, GPT-2, ALBERT, BERT, DitilBERT, RoBERTa, XLNet... GPT and GPT-2 are trained or fine-tuned using a causal language modeling (CLM) loss while ALBERT, BERT, DistilBERT and RoBERTa are trained or fine-tuned using a masked language modeling (MLM…
Rescoring Automatic Speech Recognition using Large Language Models
This repository is for the paper Lexical Substitution as Causal Language Modeling. In Proceedings of the 13th Joint Conference on Lexical and Computational Semantics (*SEM 2024), Mexico City, Mexico. Association for Computational Linguistics.
This is the implementation of low rank adaptation (LoRA) which is a subset of parameter efficient fine tuning (PEFT).
Add a description, image, and links to the causal-language-modeling topic page so that developers can more easily learn about it.
To associate your repository with the causal-language-modeling topic, visit your repo's landing page and select "manage topics."