Up next

Autoplay

Biden administration proposes "sustainable calm" in new language for Gaza ceasefire deal

00:00:27

Biden administration proposes "sustainable calm" in new language for Gaza ceasefire deal

Newsy · 33 Views · 01 Jul 2024

HISTORIC & VERY DANGEROUS CAT4 HURRICANE BERYL TAKES AIM AT ??? MODELS SHOW.....

00:14:59

HISTORIC & VERY DANGEROUS CAT4 HURRICANE BERYL TAKES AIM AT ??? MODELS SHOW.....

EEARTS · 1,441 Views · 30 Jun 2024

'We really saw President Biden in trouble': Body language expert on debate | Morning in America

00:04:10

'We really saw President Biden in trouble': Body language expert on debate | Morning in America

NewsNation · 1,224 Views · 28 Jun 2024

Analytic Philosophy Part 3: Language and Meaning

00:07:33

Analytic Philosophy Part 3: Language and Meaning

Professor Dave Explains · 3,035 Views · 26 Jun 2024

WSHH Presents "Down In the DM's" Hosted by DamnHomie - OnlyFans Models Read Their Wildest DMs! Ep. 6

00:08:01

WSHH Presents "Down In the DM's" Hosted by DamnHomie - OnlyFans Models Read Their Wildest DMs! Ep. 6

WORLDSTARHIPHOP · 5,714 Views · 26 Jun 2024

The Importance Of Language To The Abortion Debate

00:02:55

The Importance Of Language To The Abortion Debate

Dinesh D'Souza · 162 Views · 26 Jun 2024

President Trump's Spiritual Adviser Paula White - "Saka Tara" - Scream/Foreign Language/Alien Talk?

00:00:11

President Trump's Spiritual Adviser Paula White - "Saka Tara" - Scream/Foreign Language/Alien Talk?

888Davey888 · 3,002 Views · 08 Nov 2019

Live CEOing Ep 816: Language Design in Wolfram Language [Tabular]

01:00:29

Live CEOing Ep 816: Language Design in Wolfram Language [Tabular]

Wolfram · 381 Views · 21 Jun 2024

Ted Bundy and Paul Bernardo: Similarities in Language and Psychology

00:24:12

Ted Bundy and Paul Bernardo: Similarities in Language and Psychology

Martin DeCoder · 24,580 Views · 22 Aug 2022

Q2B23 SV | Quantum Generative Models of Financial Time Series | Vanio Markov & Vladimir Rastunkov

00:22:25

Q2B23 SV | Quantum Generative Models of Financial Time Series | Vanio Markov & Vladimir Rastunkov

QC Ware · 98 Views · 30 Jan 2024

00:09:12

CTMU, MADE SIMPLE: Reality = Language

CTMU SINGULARITY · 711 Views · 08 Apr 2023

The Fascinating History of Sign Language

00:15:03

The Fascinating History of Sign Language

PowerfulJRE · 304,828 Views · 31 Jan 2024

Moshe Kasher on Raves and Sign Language + Stunt Driver Robert Nagle on The Biscuit Rig

01:34:51

Moshe Kasher on Raves and Sign Language + Stunt Driver Robert Nagle on The Biscuit Rig

Adam Carolla · 14,143 Views · 29 Jan 2024

Easy MEGA Guide to LLMs in 2024 (Large Language Models) Get Into AI!

00:30:01

Easy MEGA Guide to LLMs in 2024 (Large Language Models) Get Into AI!

MattVidPro AI · 4,278 Views · 24 Jan 2024

Timcast IRL - Sports Illustrated FIRES MOST Staff, Trans Models & AI Scandal BREAK Company w/ALX

02:06:30

Timcast IRL - Sports Illustrated FIRES MOST Staff, Trans Models & AI Scandal BREAK Company w/ALX

Timcast IRL · 197,413 Views · 20 Jan 2024

What Goes Into Training AI Language Models?

00:00:21

What Goes Into Training AI Language Models?

Eye on AI · 419 Views · 19 Jan 2024

00:05:32

Introduction to the Latin Language

Professor Dave Explains · 2,717 Views · 19 Jan 2024

Dean Phillips CHANGES DEI Language After $1M Bill Ackman Donation

00:12:49

Dean Phillips CHANGES DEI Language After $1M Bill Ackman Donation

Due Dissidence · 758 Views · 19 Jan 2024

Eric Volz Praises IHOPKC Leader David Sliker Despite LEAKED Audio With Colorful Language 6 Jan, 2024

00:15:02

Eric Volz Praises IHOPKC Leader David Sliker Despite LEAKED Audio With Colorful Language 6 Jan, 2024

The Conservative Truth · 236 Views · 06 Jan 2024

What’s Your Leadership Language? | Rosita Najmi | TED

00:08:17

What’s Your Leadership Language? | Rosita Najmi | TED

TED · 15,397 Views · 04 Jan 2024

The Learners Fund - The Khan Academy story

00:06:39

The Learners Fund - The Khan Academy story

Khan Academy · 1,941 Views · 26 Dec 2023

Understanding and Mitigating Copying in Diffusion Models

00:57:03

Understanding and Mitigating Copying in Diffusion Models

Google TechTalks · 152 Views · 04 Dec 2023

Fluffy Cloud Seeding News report Rant *Language

00:06:25

Fluffy Cloud Seeding News report Rant *Language

SkyNomalies - I'm Still Standing Too · 9 Views · 01 Dec 2023

New Manifesto BOMBSHELL As Louisville Monster MATCHES Language Of Nashville Monster!

00:12:01

New Manifesto BOMBSHELL As Louisville Monster MATCHES Language Of Nashville Monster!

TheQuartering · 22,095 Views · 23 Nov 2023

Live CEOing Ep 761: Language Design in the Wolfram Language [LinkObject, Messages, and More]

01:09:43

Live CEOing Ep 761: Language Design in the Wolfram Language [LinkObject, Messages, and More]

Wolfram · 398 Views · 22 Nov 2023

Tim Minchin On Offensive Language | So F***ing Rock | Universal Comedy

00:08:47

Tim Minchin On Offensive Language | So F***ing Rock | Universal Comedy

Universal Comedy · 2,472 Views · 17 Nov 2023

Do Models actually do this? w/ John Watters #Podcast #Shorts

00:00:46

Do Models actually do this? w/ John Watters #Podcast #Shorts

Club Random Podcast · 6 Views · 09 Nov 2023

Decoding Language Model Pre-training Datasets

00:00:42

Decoding Language Model Pre-training Datasets

Eye on AI · 21 Views · 09 Nov 2023

Body language expert says DeSantis' head movements make him look weak

00:02:10

Body language expert says DeSantis' head movements make him look weak

Newsmax TV · 5,922 Views · 09 Nov 2023

Hypocrites abound lol morning rant *language

00:01:46

Hypocrites abound lol morning rant *language

SkyNomalies - I'm Still Standing Too · 4 Views · 08 Nov 2023

5 Electric Vehicle Models to Watch in the UK

00:00:57

5 Electric Vehicle Models to Watch in the UK

Bloomberg Quicktake: Now · 801 Views · 05 Nov 2023

Woman on Plane FINALLY Speaks to TMZ! What did Tiffany Gomas See?! Body Language Analyst Reacts!

00:33:35

Woman on Plane FINALLY Speaks to TMZ! What did Tiffany Gomas See?! Body Language Analyst Reacts!

The Behavioral Arts · 281,363 Views · 02 Sep 2023

Yann LeCun on World Models, AI Threats and Open-Sourcing | Eye On AI #150

00:55:37

Yann LeCun on World Models, AI Threats and Open-Sourcing | Eye On AI #150

Eye on AI · 1,221 Views · 02 Nov 2023

Creating the Language of the Pendragon Cycle

00:00:56

Creating the Language of the Pendragon Cycle

The Daily Wire · 1,307 Views · 02 Nov 2023

Navigating the Language of AI & Large Language Models | Scott Downes | Eye on AI #132

01:04:00

Navigating the Language of AI & Large Language Models | Scott Downes | Eye on AI #132

Eye on AI · 1,622 Views · 02 Aug 2023

The Future of Large Language Models in AI | Mathew Lodge | Eye on AI #130

00:49:44

The Future of Large Language Models in AI | Mathew Lodge | Eye on AI #130

Eye on AI · 6,024 Views · 19 Jul 2023

How AI Language Models Will Shape The Future | Aidan Gomez | Eye on AI #123

01:02:05

How AI Language Models Will Shape The Future | Aidan Gomez | Eye on AI #123

Eye on AI · 25,764 Views · 24 May 2023

How can learning technology help create better experiences for learners?

00:37:50

How can learning technology help create better experiences for learners?

Ufi VocTech Trust · 21 Views · 09 Jan 2023

Sean Lock on page 3 models, bad TV and losing his keys | Lockipedia | Universal Comedy

00:08:14

Sean Lock on page 3 models, bad TV and losing his keys | Lockipedia | Universal Comedy

Universal Comedy · 262 Views · 24 Oct 2023

PRODUCER DR PERIOD RECALLS MAKING HIT RECORD BROKEN LANGUAGE

00:00:53

PRODUCER DR PERIOD RECALLS MAKING HIT RECORD BROKEN LANGUAGE

MATH HOFFA · 4,282 Views · 24 Oct 2023

AI Debates, Reinforcement Learning, & The Power of Generative Models | Yilun Du | Eye on AI #147

00:55:05

AI Debates, Reinforcement Learning, & The Power of Generative Models | Yilun Du | Eye on AI #147

Eye on AI · 581 Views · 22 Oct 2023

Socio-PLT: Quantitative and Social Theories for Programming Language Adoption

00:57:53

Socio-PLT: Quantitative and Social Theories for Programming Language Adoption

Google TechTalks · 4,367 Views · 21 Nov 2012

Improved Feature Importance Computation for Tree Models Based on the Banzhaf Value

00:53:11

Improved Feature Importance Computation for Tree Models Based on the Banzhaf Value

Google TechTalks · 587 Views · 07 Apr 2023

Jon Zherka Exposes OF Models Top Donators..

00:22:35

Jon Zherka Exposes OF Models Top Donators..

Zherka Live · 7,990 Views · 10 Oct 2023

Max Tegmark: Language Models Understand Time and Space

00:11:25

Max Tegmark: Language Models Understand Time and Space

4IR with David Shapiro · 8,464 Views · 06 Oct 2023

00:00:36

The #1 Method To IMPROVE AI Models

Eye on AI · 129 Views · 05 Oct 2023

Live CEOing Ep 754: Language Design Review of Special Project Features for 14.0 continued

00:50:31

Live CEOing Ep 754: Language Design Review of Special Project Features for 14.0 continued

Wolfram · 245 Views · 04 Oct 2023

Live CEOing Ep 750: Language Design Review of GeometricSolveValues and GeometricScene

00:54:04

Live CEOing Ep 750: Language Design Review of GeometricSolveValues and GeometricScene

Wolfram · 411 Views · 30 Sep 2023

Do They Add Up? Using Macro Counterfactuals to Assess Micro Estimates and Macro Models

01:33:13

Do They Add Up? Using Macro Counterfactuals to Assess Micro Estimates and Macro Models

Hoover Institution · 1,765 Views · 30 Sep 2023

GPT-4 Vision is Extremely Capable, NEW AI Video Models, Open Source Voice Cloning | AI NEWS

00:25:22

GPT-4 Vision is Extremely Capable, NEW AI Video Models, Open Source Voice Cloning | AI NEWS

MattVidPro AI · 1,025 Views · 30 Sep 2023

GPT-2: Language Models are Unsupervised Multitask Learners

15,061 Views

Original

Yannic Kilcher

Published on 18 Feb 2019 / In News & Politics

A look at OpenAI's new GPT-2 model and the surrounding controversy. https://blog.openai.com/better-language-models/ Abstract: Natural language processing tasks, such as question answering, machine translation, reading comprehension, and summarization, are typically approached with supervised learning on taskspecific datasets. We demonstrate that language models begin to learn these tasks without any explicit supervision when trained on a new dataset of millions of webpages called WebText. When conditioned on a document plus questions, the answers generated by the language model reach 55 F1 on the CoQA dataset - matching or exceeding the performance of 3 out of 4 baseline systems without using the 127,000+ training examples. The capacity of the language model is essential to the success of zero-shot task transfer and increasing it improves performance in a log-linear fashion across tasks. Our largest model, GPT-2, is a 1.5B parameter Transformer that achieves state of the art results on 7 out of 8 tested language modeling datasets in a zero-shot setting but still underfits WebText. Samples from the model reflect these improvements and contain coherent paragraphs of text. These findings suggest a promising path towards building language processing systems which learn to perform tasks from their naturally occurring demonstrations. Authors: Alec Radford, Jeffrey Wu, Rewon Child, David Luan, Dario Amodei, Ilya Sutskever

0 Comments Sort By