scaling-laws

Code for reproducing the experiments on large-scale pre-training and transfer learning for the paper "Effect of large-scale pre-training on full and few-shot transfer learning for natural and medical images" (https://arxiv.org/abs/2106.00116)

Updated May 29, 2022
Jupyter Notebook

CJReinforce / JOWA

Star

Official code for the paper, "Scaling Offline Model-Based RL via Jointly-Optimized World-Action Model Pretraining"

machine-learning reinforcement-learning deep-learning artificial-intelligence transformer atari few-shot scaling-laws world-model

Updated Nov 26, 2024
Python

VITA-Group / Data-Efficient-Scaling

Star

[ICML 2023] "Data Efficient Neural Scaling Law via Model Reusing" by Peihao Wang, Rameswar Panda, Zhangyang Wang

data-efficient scaling-laws large-language-models model-reusing

Updated Jan 4, 2024
Python

machinelearningnuremberg / DPL

Star

[NeurIPS 2023] Multi-fidelity hyperparameter optimization with deep power laws that achieves state-of-the-art results across diverse benchmarks.

benchmark machine-learning natural-language-processing computer-vision deep-learning tabular-data transformer hyperparameter-optimization mlp hyperparameter-tuning hpo power-laws neurips scaling-laws large-language-models llm neurips-2023

Updated Nov 12, 2023
Python

christinakim / scaling-laws-for-language-transfer

Star

code for Scaling Laws for Language Transfer Learning

transformers pytorch openai transfer-learning language-model fine-tuning pre-trained-model pytorch-lightning huggingface-transformers scaling-laws

Updated Apr 18, 2021
Python

supersimple33 / Scaling-Laws

Star

A method for calculating scaling laws for LLMs from publicly available models

scaling-laws large-language-models

Updated Apr 22, 2024
Python

benjaminnNgo / ScalingTGNs

Star

First temporal graph foundation model dataset and benchmark

scaling-laws foundation-models temporal-graph-neu

Updated Nov 24, 2024
Python

upunaprosk / small-language-models

Star

Code for CoNLL BabyLM workshop Mini Minds: Exploring Bebeshka and Zlata Baby Models

language-model ipu roberta gpt-2 pretraining graphcore scaling-laws

Updated Oct 23, 2023
Jupyter Notebook

linhaowei1 / Fine-tuning-Scaling-Law

Star

🌹[ICML 2024] Selecting Large Language Model to Fine-tune via Rectified Scaling Law

nlp machine-learning model-selection language-model fine-tune scaling-laws llm

Updated Jul 1, 2024
Python

tanaydesai / scaling-laws

Star

Scaling laws web calculator to get a model's training compute flops, costs and energy utilization.

ai deep-learning transformers scaling-laws llms

Updated May 27, 2024
JavaScript

ArlindKadra / DPL

Star

[NeurIPS 2023] Multi-fidelity hyperparameter optimization with deep power laws that achieves state-of-the-art results across diverse benchmarks.

hyperparameter-optimization hpo multi-fidelity scaling-laws learning-curve-prediction

Updated Oct 16, 2024
Python

Improve this page

Add a description, image, and links to the scaling-laws topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the scaling-laws topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

scaling-laws

Here are 16 public repositories matching this topic...

huggingface / datablations

LAION-AI / scaling-laws-openclip

ryoungj / ObsScaling

kyo-takano / chinchilla

xiaoyuxie-vico / PyDimension

SLAMPAI / large-scale-pretraining-transfer

CJReinforce / JOWA

VITA-Group / Data-Efficient-Scaling

machinelearningnuremberg / DPL

christinakim / scaling-laws-for-language-transfer

supersimple33 / Scaling-Laws

benjaminnNgo / ScalingTGNs

upunaprosk / small-language-models

linhaowei1 / Fine-tuning-Scaling-Law

tanaydesai / scaling-laws

ArlindKadra / DPL

Improve this page

Add this topic to your repo