This Week in Multimodal/GPT-3/OpenAI News (Links) PART II

Feb 09, 2022

More links for you, continuing my last email!

Memory-assisted prompt editing to improve GPT-3 after deployment abs: arxiv.org/abs/2201.06009

Deepak Pathak @pathak2206

LLMs like GPT-3 and Codex contain rich world knowledge. In this fun study, we ask if GPT like models can plan actions for embodied agents. Turns out, with apt sanity checks, even vanilla LLMs without any finetuning can generate good high-level plans given a low-level controller.

Bakz T. Future 🇨🇦 @bakztfuture

Don't say I didn't raise the alarm about replika last year. They're a "unique agent" in the language model world to say the least

futurism.comMen Are Creating AI Girlfriends and Then Verbally Abusing ThemA grisly trend has emerged: users who create AI partners, act abusive toward them, and post the toxic interactions online.

Mina Lee @MinaLee__

CoAuthor: Human-AI Collaborative Writing Dataset #CHI2022 👩‍🦰🤖 CoAuthor captures rich interactions between 63 writers and GPT-3 across 1445 writing sessions Paper & dataset (replay): coauthor.stanford.edu Joint work with @percyliang @fabulousQian 🙌

Each session starts with a prompt. Writers then freely write, request suggestions from GPT-3, accept or dismiss suggestions, and edit accepted suggestions or previous texts in any order they choose.

Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents abs: arxiv.org/abs/2201.07207 project page: wenlong.page/language-plann… LLMs such as GPT-3 and Codex can plan actions for embodied agents, even without any additional training

Bakz T. Future 🇨🇦 @bakztfuture

Breaking!! 👀👀openai.com/blog/introduci…

openai.comIntroducing Text and Code Embeddings in the OpenAI APIWe are introducing embeddings, a new endpoint in the OpenAI API that makes it easy to perform natural language and code tasks like semantic search, clustering, topic modeling, and classification. Embeddings are numerical representations of concepts converted to number sequences, which make it easy f…

Maithra Raghu @maithra_raghu

LaMDA: Language Models for Dialogue Applications Paper: arxiv.org/pdf/2201.08239… Blogpost: ai.googleblog.com/2022/01/lamda-… Excited to see this paper come out! I enjoyed the weddell seal conversation with LaMDA in our 2021 research summary blogpost!

Bakz T. Future 🇨🇦 @bakztfuture

We've trained GPT-3 to be more aligned with what humans want: The new InstructGPT models are better at following human intent than a 100x larger model, while also improving safety and truthfulness. https://t.co/rKNpCDAMb2

Bakz T. Future 🇨🇦 @bakztfuture

This is exactly why I spoke out so much against the name "prompt engineering" last year bakztfuture.substack.com/p/the-problem-…

Boris Power @BorisMPower

Our Instruct models are also a lot more intuitive and easier to use. No need for complex prompt engineering - now you can just specify what you'd like the model to do, in the same way as you would to a human! https://t.co/1GvV7DMPC8

Bakz T. Future 🇨🇦 @bakztfuture

Mantium announces some clever approaches to sharing and iterating on GPT-3/LLM prompts

Bakz T. Future 🇨🇦 @bakztfuture

Let me try again - doesn't instructGPT represent a larger direction of a world where language models don't really need any prompt engineering? As a result, will prompt design be a thing of the past? Also, which cases make sense to use GPT-3 DaVinci over instructGPT?

Chain of Thought Prompting Elicits Reasoning in Large Language Models abs: arxiv.org/abs/2201.11903

Daniel Solis, birds suddenly appear @DanielSolis

These Birds Do Not Exist I trained an AI on public domain bird illustrations from old books. Ornithologists and birders, I'd LOVE it if you were able to still ID some of these weirdos. I'll share some of the "normal" results first, the ones that kinda sorta look like real birds.

Arvind Neelakantan @arvind_io

Zero-shot results of OpenAI API’s embeddings on the FIQA search dataset. Evaluation script: github.com/arvind-neural/… We zero-shot evaluated on 14 text search datasets, our embeddings outperform keyword search and previous dense embedding methods on 11 of them!

Arvind Neelakantan @arvind_io

In text search tasks, we obtain best zero-shot results in msmarco, triviaQA, and NQ and also the best transfer results on the BEIR benchmark. 5/7 https://t.co/ICO2fqtNgu

Discussion about this post

No posts

#nojs-banner { position: fixed; bottom: 0; left: 0; padding: 16px 16px 16px 32px; width: 100%; box-sizing: border-box; background: red; color: white; font-family: -apple-system, "Segoe UI", Roboto, Helvetica, Arial, sans-serif, "Apple Color Emoji", "Segoe UI Emoji", "Segoe UI Symbol"; font-size: 13px; line-height: 13px; } #nojs-banner a { color: inherit; text-decoration: underline; } This site requires JavaScript to run correctly. Please turn on JavaScript or unblock scripts