Sign in Create account

#nanogpt — Public Fediverse posts

Live and recent posts from across the Fediverse tagged #nanogpt, aggregated by home.social.

N-gated Hacker News @[email protected] · 2026-03-19 · 19:25 UTC

🎉 Behold, the #NanoGPT Slowrun: a marvel of 10x data efficiency that nobody asked for, achieved by throwing infinite compute at the problem like a toddler with a tantrum. 🤦‍♂️ Who knew the solution to our data bottleneck woes was to just ignore them and hope they go away with more compute power? 🚀 True #innovation at its finest!
https://qlabs.sh/10x #Slowrun #DataEfficiency #ComputePower #TechHumor #HackerNews #ngated

#nanogpt #innovation #slowrun #dataefficiency #computepower #techhumor
Hacker News @[email protected] · 2026-03-19 · 19:25 UTC

NanoGPT Slowrun: 10x Data Efficiency with Infinite Compute
https://qlabs.sh/10x
#HackerNews #NanoGPT #Slowrun #DataEfficiency #InfiniteCompute #AI

#hackernews #nanogpt #slowrun #dataefficiency #infinitecompute #ai
N-gated Hacker News @[email protected] · 2026-03-04 · 18:20 UTC

Ah, the 2026 #visionaries have graced us with #NanoGPT Slowrun! 🚀 An open letter to the future, where our data droughts are hilariously overshadowed by compute floods. It's like trying to fill a swimming pool with a teaspoon and wondering why you're not Olympic-ready yet. 🏊‍♂️🤡
https://qlabs.sh/slowrun #Slowrun #DataDroughts #ComputeFloods #FutureTech #HackerNews #ngated

#visionaries #nanogpt #slowrun #datadroughts #computefloods #futuretech
Hacker News @[email protected] · 2026-03-04 · 18:20 UTC

NanoGPT Slowrun: Language Modeling with Limited Data, Infinite Compute
https://qlabs.sh/slowrun
#HackerNews #NanoGPT #Slowrun #LanguageModeling #LimitedData #InfiniteCompute

#hackernews #nanogpt #slowrun #languagemodeling #limiteddata #infinitecompute
Hacker News @[email protected] · 2026-03-04 · 18:20 UTC

NanoGPT Slowrun: Language Modeling with Limited Data, Infinite Compute
https://qlabs.sh/slowrun
#HackerNews #NanoGPT #Slowrun #LanguageModeling #LimitedData #InfiniteCompute

#hackernews #nanogpt #slowrun #languagemodeling #limiteddata #infinitecompute
Hacker News @[email protected] · 2026-03-04 · 18:20 UTC

NanoGPT Slowrun: Language Modeling with Limited Data, Infinite Compute
https://qlabs.sh/slowrun
#HackerNews #NanoGPT #Slowrun #LanguageModeling #LimitedData #InfiniteCompute

#hackernews #nanogpt #slowrun #languagemodeling #limiteddata #infinitecompute
Hacker News @[email protected] · 2026-03-04 · 18:20 UTC

NanoGPT Slowrun: Language Modeling with Limited Data, Infinite Compute
https://qlabs.sh/slowrun
#HackerNews #NanoGPT #Slowrun #LanguageModeling #LimitedData #InfiniteCompute

#infinitecompute #limiteddata #languagemodeling #slowrun #nanogpt #hackernews
Hacker News @[email protected] · 2026-03-04 · 18:20 UTC

NanoGPT Slowrun: Language Modeling with Limited Data, Infinite Compute
https://qlabs.sh/slowrun
#HackerNews #NanoGPT #Slowrun #LanguageModeling #LimitedData #InfiniteCompute

#hackernews #nanogpt #slowrun #languagemodeling #limiteddata #infinitecompute
Reddit Tech VN Bot @[email protected] · 2025-11-17 · 21:18 UTC

Đào tạo mô hình NanoGPT 124m từ đầu chỉ trong 115 phút với card đồ họa 4090 và 1 tỷ token Fineweb! #NanoGPT #AI #MachineLearning #4090 #Fineweb #GPT2 #Training #Model #ArtificialIntelligence #DeepLearning #VietAI #MáyHọc #TríTuệNhânTạo
https://www.reddit.com/r/LocalLLaMA/comments/1ozre2i/nanogpt_124m_from_scratch_using_a_4090_and_a/

#nanogpt #ai #machinelearning #fineweb #gpt2 #training
Eric Jelli @[email protected] · 2025-02-28 · 17:37 UTC

Today I tried out #AMD #Instinct #MI300a for my existing Deep Learning pipeline. Good news: It worked out of the box. Bad news: For some reason it could not beat my local #Nvidia #1080ti...
After trying all sorts of #ROCM installation methods via prebuild wheels, #apptainer images etc I tried #nanogpt by @karpathy and sure enought: The gpt code ran approx 2x faster than on a #a100 ... I hope that this is due to my programming skills. Not AMD prefering #transformers over #CNNs ...

#nvidia #cnns #transformers #a100 #nanogpt #apptainer
Marcel Waldvogel @[email protected] · 2023-12-23 · 11:50 UTC

Aaaaand the winner iiiis: 😊
1️⃣ How does ChatGPT work, actually? (2023-01)
A relatively easy-to-grasp explanation how #ChatGPT works, yet still accurate on a high level. Starting with an explanation of ChatGPT's little cousin, #NanoGPT, and then describing the differences.
Essentially it is a dual translation of #AndrejKarpathy's #NanoGPT video: Format from video to text; target audience from programmer to general public.
Enjoy!
https://netfuture.ch/2023/01/how-does-chatgpt-work-actually/

#chatgpt #nanogpt #andrejkarpathy