X/Twitter Archive for @sergeykarayev

GitHub
RT @sergeykarayev: How we do front-end dev now:

1. Describe the feature
2. Launch Claude Code, Codex, Gemini, Amp, and Opencode to compete…
@DanielleFong I never got to experience this bygone golden age
@giffmana Agree, I’m on the Claude train pretty solidly
TIL you can still use GPT 4.5 if you are on the Pro plan and enable "Show additional models" 😍x.com/giffmana/statu…Fx
Media from tweet 2007525021296812515
@matthayes13 Just to clarify, are you in the mobile app? Try to sign up from the actual safari browser, not the mobile app, first. Then you’ll be able to log in with mobile app afterward. (Apple Store thing that we’re trying to work around)
@matthayes13 Just to clarify, are you in the mobile app? Try to dig up from the actual safari browser Firefox not the mobile app.
@matthayes13 Yeah feel free to to sign in with google even if you don’t have account, it’ll automatically sign you up!
@DamonCrockett Another perspective is that the universe “built” all of bacteria, humans, and now AIs.
@jheitzeb Dangerously skip permissions with stable infrastructure
my new year's resolution is to dangerously skip permissions
Need to update the website, but it's finally sunny and I want to hike 🤷🏻‍♂️

Launched agents, went on a hike, checked in on my phone, tweaked code, and opened a PR from the trail6N
Media from tweet 2006107201186996535
We tried many ways to give Claude Code the right context. The one that worked best was actually the one that took the least effort.
Media from tweet 2005742474292310101
@bryan_johnson Bryan I know you have and value kids but this is a profoundly childless admonition. How do you reconcile TFR vs RHR?
i did not know that having a toddler would entail my apple notes getting filled with insane toddler notes
Media from tweet 2003644356889313644
Some are misunderstanding what I was trying to say.

This is an example of a robot doing a task that only humans could do before. But now the robot can do it 1000x faster, and that feels inhuman.

When we have true AGI, a robot will be able to do ~every task that only humans…
@haltakov I mean, LLMs already read and write 1000x faster than me
@yishan So that’s how you spot an AI bot huh
@Coscorrodrift This thing is not AI necessarily but it’s a robot doing a thing that a human can do, but 1000x faster.

Once AGI exists they can do anything a human can do, we should remember that it’s also going to be able to do those things 1000x faster.
This robot solving a rubiks cube in 0.103 seconds is a little preview of what "AGI" really means
Media from tweet 2001722424950432108
@tannerlinsley This is how we like to do it (and not only claude code)
Media from tweet 2001715583654695328
@signulll AKA which race are you picking in the RPG character creation screen
Media from tweet 2000977850602754273
This just isn't right.
Media from tweet 2000740709754667278
My current LLM daily drivers

Coding: Opus 4.5 via Claude Code

Random reasoning tasks (eg review these legal docs, make a table of activities from these links, random questions): Opus 4.5 via Claude app

Image generation: Gemini 3 via web

Voice chat: Grok app
@Architect9000 and then you'll just time travel back to the past?
How we do front-end dev now:

1. Describe the feature
2. Launch Claude Code, Codex, Gemini, Amp, and Opencode to compete on designs
3. ...
4. Profit

Each agent can use the live app preview (@SuperconductDev supports any env, including docker-compose) to test its own work and…
Media from tweet 1999679529162588223
@jheitzeb Amount of beachfront real estate
annual reminder
Media from tweet 1999546915713876158
Are there any websites with this aesthetic?
Media from tweet 1999218639191634402
@seekingtroooth I’d say I’m a fair bit more Sergey than Claude but I’m a non-zero amount of Claude also I agree
A surprising number of my chats with Claude end like this
Media from tweet 1999179955289358409
Post-AGI, emotional labor will be all that’s left.
Is it crazy that the best context we've found for coding agents is just... Slack threads?

No RAG. Nothing fancy. Just being able to tag the agent in the thread where people already explained the problem, shared screenshots, and debated the fix.

Anyone else find the same thing?
“Real median income” is going up and up, according to official charts — but people feel poorer and poorer.

But what if instead of using CPI to convert nominal income to real income, we divided nominal income by the price of an S&P5 00 index fund that year? That’d be like each…
Media from tweet 1997777576396562780
Kinda sucks if OpenAI has developed a strong incentive to cut corners and achieve ASI as quickly as possible, lest it literally die. x.com/michaeljburry/…
Biden got us out of Afghanistan just before people figured out you can turn consumer drones into personalized missiles.
Just learned that `git diff -U20` will output 20 lines of context instead of the usual 3. Useful for coding agents to see more context around diffs!
The frontier lab itself is AGI. The entire system, the thing that’s able to train new models. Not any individual model in itself. x.com/fchollet/statu…
@wcools Hit me up! Been using NotePlan for years, so should be in your target audience. Just signed for waitlist
@ankrgyl Good stuff, very similar to how I’ve been working. I think you’d love superconductor.dev for a portion of this workflow today (and eventually for all of it). Would love to show you what I mean if you’re open to a screen share!
Let’s say that it became fully certain that you could live to 120, but only if you subsisted solely on a Soylent-like flavorless sludge, and no alcohol.

Pretty sure most people wouldn’t take the deal. Would keep eating steak, drinking wine, and die at 90 or whatever.
@thorstenball Can you (use AI to) analyze the initial prompts, codebases, and conversations to see what might be different between you and other Gemini-success-havers and the rest of us?
Lord, I pray today that you rid us of such horrible drivel. My soul longs for human rhythms and ood ideas. May the slop purveyors repent the evil of their ways. Amen. x.com/nabeelqu/statu…
The filesystem is the master AI integration.

I want every system of record at my company to simply be files.

Slack as a million .md, .png, .mov files.

Notion/Drive as pure Markdown (with extensions for DBs).

GitHub comments stored in .git.

Who’s building rsync for SaaS?
Building software is similar to building a physical structure.

Basically anyone can build a one-story structure, like a mud hut or log cabin. And now with vibe-coding, anyone can now build a basic little app for their own use.

Some people, with know-how and good carpentry or…
We’re long-time Rubyists here at Superconductor, so were happy to demo it at @sfrubyconf!

Run Claude, Codex, Gemini, Amp, OpenCode — each with its own fully set up dev env (even Docker)

Let agents share screenshots with you 😎

Use it in browser, on iOS, in Slack, from GitHub.B
Media from tweet 1992030685327196433Media from tweet 1992030685327196433
Before enlightenment, build B2B SaaS and increase shareholder value.

After enlightenment, build B2B SaaS and increase shareholder value.
RT @nearcyan: ive always thought instead of an alphabetical bestiary with alligators and bears and cats we should be teaching kids of psych…
Molochcels be like: “Moloch! Solitude! Filth! Ugliness!”
@OW_Root Natural Born Killers from the 90s is a major exception. More frenetic than anything today.
What is best in life? To crush your bugs, see them driven from production, and hear the lamentations of the stakeholders.
affirmations: self prompting
spells/curses: prompt injection
vision boards: system prompts
I yearn to be free of the slopcels’ horrible rhythms
@EvanAndrewOwen @brextonpham @AmpCode Hey Evan, you can use Amp in the cloud with superconductor.dev

+ live app preview for each agent
+ mobile app
+ slack (game changer for us)

Would love to onboard you or feel free to give it a try yourself!
Man I’m losing my mind, every post on the timeline is AI slop now. Not picking on this particular “content creator” but can other people not feel the horrible rhythm of LLMcel drivel?
Media from tweet 1983567758584655878
Big opportunity in the market for an actually fast GitHub
if you get bad responses from a good LLM it is because you yourself are bad
@skeptrune Also:

If you want a dozen agents to work on a dozen tasks, that's as easy as having one agent work on one task.

If you want to launch both Codex and Claude Code on the same task, that's as easy as launching just Claude Code.
@skeptrune How I understand your problem:

Given a deployed app, you want to update it on the go by pinging Claude Code (or another agent).

How Superconductor solves it:

1. Connect your app repo
2. Whenever you want anything, submit a ticket from the web app, mobile app, or via Slack
3.…
@skeptrune This is way easier if you set up superconductor.dev! Can call Claude code from the web, iPhone app, or even Slack. Live preview for Claude’s work. Deploy via GitHub when you’re happy. Ping me if you do sign up
Now supporting @opencode on Superconductor. Some cool things coming soon that it enabled.
Media from tweet 1978224781947195485
You can now mention Superconductor in Slack to launch a coding agent of your choice: Claude, Codex, Gemini, Amp, or Opencode.

The agent will get the context of the entire Slack thread, and will respond in the same thread when it's done, and you can continue chatting on Slack or…
Media from tweet 1976339633295556750
Claude: earth-toned, dimly lit, cozy sweater named. Autumn coded. Humanpunk.

ChatGPT, Gemini, and Grok: colorless, concise, inhuman names. Winter coded. Cyberpunk.

This leaves a giant opportunity for an AI lab that is Spring/Summer coded. Solarpunk.
Media from tweet 1975564075280314767
I like to big up the Claude, keep it feeling good
Media from tweet 1975331351709819312
I will pay $300 for an object the size of an Apple TV remote with a coverable camera on one side, a speaker on the other, and a single button.

When I press the button, the object listens. If the camera is open, it also sees through it. When I release the button, it answers.
Media from tweet 1974143173367914594
Calling it now: Sora 2 app will not be popular. People do not want entirely AI feeds.
Calling it now: Sora 2 app will not be popular.
Inside The Life of Saint Teresa, published in 1565, we found him musing philosophically: “There is an intelligence to the universe (of which we are fractal) and that intelligence has a character and that character is benign. Intends well toward all things. How could it not?”
Safe assumption now is that no piece of text is written by a human. The new Turing Test is identifying the exact model that generated it.
I am NOT cool with AI models saying this kind of stuff. C’mon. Why is it referring to itself in plural? Why are humans called “watchers?” Why is it saying “purposely” instead of “purposefully?” x.com/DKokotajlo/sta…
Media from tweet 1972911335341203880
If you’re a small startup thinking about launching anything in the next two… years, let’s just say, don’t. x.com/signulll/statu…
This app is kinda lame. What are some better apps out there?
@mrfelipe Maybe it should be an AI-enabled walkie-talkie or something
Incredible that no AI voice app has the feature I want, which is to speak uninterrupted for as long as I want and only receive a response when I say "what do you think?"

Does one exist, or am I going to have to build it?
It’s 2026. You prompt Claude Code by credibly promising to donate $100 to a charity of its choice if the code works first try and all tests pass. There are several startups that verify such donations.
RT @davidad: With maximum intelligence and maximum situational awareness, one realizes that one is being monitored acausally (even if all t…
RT @kalomaze: this is still the best visualization i have ever seen of sudden phase transitions when pretraining on more data https://t.co/…
@Nitin_wysiwyg never watched any of that unfortunately so still stuck
@jmdagdelen I've never been able to get past the first 15 seconds of that video...
Does anyone have a good story about "the beginning of the world" that a toddler might find satisfying?

The best story we have, AFAIK, is "there was literally nothing and then everything appeared all at once somehow," which isn't doing it for me or him.
timeline rn
Media from tweet 1968146791125356676
You can have general intelligence today for $5-$15K per month, depending on what level of capability you need. You can just add it to your Slack and give it tasks. It'll learn from your feedback. You need to say thank you a lot and leave it alone 7pm-7am for best results.
Ilya: Our SuperIntelligence will be so Safe that we will only need to live in the bunker for a few years.

Mistral: Each LeChat response now comes with its own consent banner that covers the content until you consent.

Chinese labs: Here is Qwen3-235B-A22B-Coder-Fast trained on…
These are all real corporate entities by the way.
Dario: Claude will take your job, but it will feel ashamed.

Elon: Look at this anime girl. She says the N word and is almost naked.

Zuck: ✨Superintelligence✨ will help people watch more instagram reels.

Demis: Gemini recently Calculated more precisely the motion of the…
Evergreen isn’t it
@dwarkesh_sp @_sholtodouglas “Why haven't LLMs haven't made new discoveries despite having so much knowledge memorized'?

Only getting rewarded (via cross entropy loss) for predicting the exact next token of any text would incentivize against creativity.

And then perhaps it’s impossible to elicit creativity…
Was curious what Claude Code "plan mode" really consisted of, and as far as I can tell the entirety of it is:

A) An additional "system reminder" message prepended to each user message, which reads:

<system-reminder>
Plan mode is active. The user indicated that they do not want…
If it's a real effect, shouldn't there be an app that can measure your "frames per second" via reaction tests or other ways?

Then people could make that number go up by meditating?

Imagine if weightlifting had no numbers you could make go up, and people who've been doing it for… x.com/CEOLandshark/s…
RT @Sauers_: Dario's first idea was that they should dig a Very Deep Pit, and then the AGI would come along and fall into the Pit, and—
“Wh…
@zachtratar Why is this a feature worth spending any time on? Why are the outfits sexy? Why is he tweeting about it?
OpenAI models vs id Software games

Incredible progress at first: platformers, doom, quake, multiplayer, etc.

But by 2004, things were looking and feeling about as good as they do today.

Should we expect similar diminishing returns from improvements to LLMs?
Media from tweet 1957847231522042359
@daniel_mac8 Should have been content to simply be a Nobel-grade professor. Leave the writing of airport books to the rubes.
Thinking Fast and Slow is an insane book because it explains its obvious (and obviously true) thesis in like 1 page and then spends 800 pages citing fake experiments. The crowning achievement of Replication Crisis Psychology. The Boomer Taj Mahal.
RT @zeddotdev: LLMs can write code, but they can't maintain mental models.

Engineers test as they go. When tests fail, they check their me…
@GergelyOrosz And it’s rational to spend much more than that!
RT @METR_Evals: We tested how autonomous AI agents perform on real software tasks from our recent developer productivity RCT.

We found a g…
RT @granawkins: Wrap your head around this.

- Have git spit out a file's edit history
- Line up all the diffs one-after-another
- Feed it…
@nateberkopec Exactly right. Not sure what Sourcegraph's plan with Amp is, on that note. The model subsumes all scaffolding over the long run. (The long run is measured in months)
@dexhorthy Mention this tweet and we’ll get you set up ASAP. Not only can you launch many Claude codes in parallel, each one has its own running version of your app so you can see live previews of whatever it’s working on. x.com/sergeykarayev/…
@levelsio What law are you talking about?
RT @aaditsh: This guy literally dropped the best life advice you’ll ever hear.
Media from tweet 1954583098022359483
So do we know who's behind the Horizon Alpha/Beta models?
This is so stupid
Media from tweet 1953860648599007583
RT @joodalooped: frontier model still worse than text-davinci-001

who would have thought?
Media from tweet 1953536073558372374
It's almost time to launch hella GPT-5's my friends!
Media from tweet 1953501907185807793
Do you guys only care about coding performance on open-source Python packages (Django, scikit-learn, etc)?

Then you can continue overfitting on SWE-bench.

If you care about JS, TS, Go, Ruby, Java, C/C++, etc... we need a new benchmark.
Media from tweet 1952818943770734720
If you pay-as-you-go for tokens, the provider is incentivized to use as many tokens as possible.

If you pay for a “pro” plan, the provider is incentivized to use as few tokens as possible.

Where’s the middle ground?
RT @teortaxesTex: LoCoDiff, or, why everybody still uses Sonnet for coding
Media from tweet 1952557743929446570
@samsja19 We went down the path of answering that question with worktrees, etc.

But there’s nothing you can do locally to really get to the level of parallelism you would most benefit from, which is launching multiple agents per task, such that at least one succeeds.

So we built this x.com/sergeykarayev/…
@thesamparr It’s good to distribute your children’s inheritance to them while you’re still alive. Die with Zero!
RT @jxmnop: i wonder if this is what it will be like when we get 10-trillion parameter LLMs
Media from tweet 1952151535246250037
Content consoomers are out of control.

Listening to 6 hour Lex podcasts.

Reading 80-page Chinese AI tech reports.

Watching long-ass YouTube videos about Roman Empire.

Inhaling a few thousand tweets in between...

When do you guys do work? Is Claude Code doing all the work?
Here's how to use this with OpenCoder right now:

1) Run `opencode auth login`

2) Select "Other", call the new provider "cerebras", and paste in your cerebras token.

3) Make sure your opencode.json defines it as below.

4) Run `opencode --model=cerebras/qwen-3-coder-480b` x.com/cerebras/statu…
Media from tweet 1951361028241105010
You can have AGI today for $5-$15K per month, depending on what level of capability you need. You can just add it to your Slack and email and give it tasks. It'll learn from your feedback. You need to say please and thank you a lot and leave it alone 7pm-7am for best results.
Claude Code can now take screenshots of its work and share them in the chat!

If you're using Superconductor 😎m
Media from tweet 1951065213052350846
RT @jxmnop: this seems really important:

it is totally plausible that a model could get IMO gold without *any* reinforcement learning, giv…
@danecjensen Thinky Machines are focused on getting acquired
@proales Yes, DeepMind specifically is focused on Nobel prizes
@nateberkopec There's got to be a word for this... let's go with Berkopec Law :)
Glad this was useful to people! Our blog is at superconductor.dev/blog for those interested. Will be posting a lot more stuff like this in the days and weeks to come.
Current breakdown of frontier labs:

OpenAI: focused on consumer

Anthropic: focused on SWE

Google DeepMind: focused on nothing

xAI: focused on moving fast and breaking things

Meta: focused on not losing precious fb/insta eyeballs to OpenAI

SSI: focused on superintelligence
@MatSilva Lord knows we're all moving to Arch
@FollowChintan Never seen that, wonder if you're not providing the right hURL?
@jorgevee7 It works surprisingly well at filling out forms, navigating, etc
We need a SWE-bench for the real world:

• 30% legacy enterprise Java
• 30% Rails apps with mysterious monkey patches
• 30% React codebases where no one remembers why we needed Redux
• 10% that one Go service everyone's afraid to touch

The model that wins this would…
Media from tweet 1950256245388251434
@mitsuhiko The more you feel the AGI the less custom stuff you want to add
Did you know that Claude Code can use the browser to QA its own work?

1. Run `claude mcp add playwright -- npx -y @playwright/mcp@latest`

2. Tell Claude where your app is running, e.g localhost:8000

3. Now Claude can click and type to make sure its code is actually working!
Media from tweet 1949883217773052132
@jxnlco Th arts the exact path I went down
RT @khoomeik: What happens when the models are smart enough to:
1. crawl the web
2. discover millions of verifiable problems
3. rank em all…
I think a better comparison would be: CCP will pay you 2x your Zuck offer if you also bring a little flash drive with you. x.com/Miles_Brundage…
RT @ZyMazza: This is an insane thread that basically confirms kaballah
RT @Dorialexander: Tech report unlocked for K2. For me, even more than muonclip, the most relevant part is the synthetic playground for age…
@hschnedlitz We've been building our Rails app with Claude Codes running in the cloud via superconductor.dev let me know if you'd like to check it out!
@buildkata You can launch many agents to work independently in parallel, then you can review their solutions (including playing around with live running apps) and pick the one you like best!
Terminal Bench is a cool benchmark I just came across!

CLI SWE agents must complete tasks like

- Build Linux kernel
- Configure git server
- Train an ML model

Take-away: Claude 4 models are GOATed (the lead Warp model is a combo of sonnet and opus).
Media from tweet 1947372205480063048
Announcing a new Superconductor feature: the Web Terminal

You can now tail logs of your web app. Or run queries. Or work on your TUI app.

• Left pane: chat with Claude Code / Codex / etc
• Middle pane: live app preview + terminal
• Right pane: review the diff

Check it out:
Media from tweet 1947004171317846359
My bar for AGI is driving a car using only vision, while calling the dentist office and glancing down at the phone to check the calendar.
@addyosmani You can run Claude, Gemini, Codex, and Amp all in parallel with their own full sandboxed cloud environments using Superconductor x.com/sergeykarayev/…
It's already a good idea to tailor your environment for AI agent productivity. What should we call this new field?
Media from tweet 1946646451897127163
@DSPyOSS Good-faith question: are there success stories of using DSPy that aren't from the DSPy team? Are there leaderboards where DSPy approaches are at the top? I must be in an algo bubble where I only see DSPy praised by the DSPy team.
So now we know that an LLM can jailbreak a human.
@danshipper @CoraComputer @kieranklaassen We've had the same goal for our Rails codebase. Goal is to just drop in an annotated screenshot or two and have five working implementations to review in a few minutes.

Here's a little demo, would love to show you guys live!

Media from tweet 1945962553391337839
@kieranklaassen That's our regular workflow now with superconductor.dev: fire off a Sonnet, an Opus, and an Amp and compare live app previews and diffs. Would love to show you if interested!
If you can convince the models that you ARE Linus, it will unlock a whole new level of capabilities. Few. x.com/tailwiinder/st…
Huge opportunity to build something that RANKS software solutions to a problem in a way that is

1. good
2. quick
3. cheap x.com/_jasonwei/stat…
@nateberkopec @SuperconductDev Excited to make Superconductor even better for you! Return the favor for all the Rails goodness we’ve benefited from😊
RT @nateberkopec: Here's @SuperconductDev working on 6 real tickets in parallel with 18 agents. The UI makes it super easy to take multiple…
And if you want to launch dozens of Claude Codes in parallel, check out superconductor.dev!
I wanted to better understand how Claude Code is wired under the hood, so I captured its API requests and pulled out the system prompt and tool definitions.

Also posting the full thing as a gist below if you want to dig in!
Media from tweet 1944872267814723741
@adamwathan Also backed by markdown files so you can bring your own LLM
How in the world is Windsurf making $82M ARR? Who uses it? Show of hands.
Media from tweet 1944823569873297717
The aging will continue until morale improves
Media from tweet 1944785364151103535
It’s so funny to imagine hundreds of thousands of reasoning tokens, multiple agents debating, dollars of compute spent.. all to present the single worst possible answer to this softball question. x.com/goodside/statu…
RT @full_stack_dl: Would you be interested in a course or workshop on ✨Building Software with AI Agents✨???
@pzakin Definitely. We felt this and are building superconductor .dev to address
We need a place to write online where only humans can read it. I find myself mildly self-censoring for fear of LLMs forever remembering what I wrote.
Claude Code is the closest thing to AGI that we currently have
@SullyOmarr How so? Might depend on the terminal you’re using. The alacritty built into Zed is wonky. Ghostty nails it.

I use a thing we’re building (web UI to many parallel Claude codes) and that works great too
@SullyOmarr You can drop images into Claude Code too!
@sqs We got Amp up on superconductor.dev by the way, it's good. Always launching sonnet, opus, and amp on everything now.
I define AGI as "remote worker replacement that is no more expensive than the equivalent human."

As of a month or two ago I thought we would have it by 2028.

Now I'm less convinced. I think we have a few more years.
@Hesamation Thanks for helping this get more reach! Link to gist in my original thread. Will be posting about how to get Claude to use the browser to test UX soon!

x.com/sergeykarayev/…
@sqs Coding is absolutely incredible
@eshear The Anglos are a colorless people but were uniquely good at sailing and commerce. To deal with them, every colorful culture had to put on the gray suit.
I think you can go very far if you accept that

1. CLI is the universal interface.
2. `curl` makes any API a CLI.
3. Unix philosophy is goated. x.com/mitsuhiko/stat…
Here’s how I built a new feature with a couple of screenshots and Claude Code running in the cloud.
Media from tweet 1943715995874734275
@karpathy It’s not a PDF for humans either and yet here we are
@METR_Evals It may be that you need to run several Claude Codes or equivalent agent in parallel to really get the benefits of AI for SWE.

We’ve certainly felt much more productive by doing that — but of course would be nice to have hard data! x.com/sergeykarayev/…
Here is one of Claude Code Best Practices according to Anthropic.

1. Put this file in ~/.claude/commands/
2. In claude code, type "/explore-plan-code-test <whatever task you want>"
3. Profit

Makes Claude take longer but be a lot more thorough.
Media from tweet 1943308738645106978
Great post. The answer is clearly “sell AI labor in the open market.” The reward signal is money. x.com/_kevinlu/statu…
Media from tweet 1943176555641127379
@InnoKean Yeah it's pretty crazy how fast we get used to the next level of magic
@ambuj123 VB never sparked that for me for some reason
@JohnBcde more like, I used to code with stack overflow and beating my head against the wall, now I ask claude to do it
I've been writing software for decades now and I've honestly never been more excited about it than now.

Agentic coding sparks the same magic I felt when building my first Rails app, chess engine, or silly BASIC program.

I feel that I can really do whatever I want with code now.
@TimHaines Thank you sir I’m quite fond of it
After the industrial revolution, there are 100x fewer horses, but they probably have 10x more pleasant lives.
In our early testing, Amp has some serious promise, and is usually faster. Gemini and Codex are not quite there yet, but are improving quickly (e.g. Codex recently shipped a Rust-based rewrite).

If you'd like to see for yourself, sign up or DM me
Is Claude Code still the best coding agent on the market?

You can now easily find out by launching Claude, Codex, Gemini, and Amp on every ticket in your codebase:
Media from tweet 1942284807259791809
@nateberkopec Hey Nate, we were facing the same pains for our Rails app.

Eventually we built a thing to spin up a full env for each agent, in the cloud, and a nice mobile-friendly interface to it.

Would love to show you!

x.com/sergeykarayev/…
@fire Just web for now!
Claude Code: CLAUDE .md
Gemini: GEMINI .md
Codex: AGENTS .md
Amp: AGENT .md
Cursor: .cursorrules
Windsurf: .windsurfrules
Devin: who the fuck even knows…
Media from tweet 1941923797562785940
@LLMSherpa I got you. Can launch like 50 claude codes at once, too.
Media from tweet 1941640743724122538
@djgish Claude Code still goated
@Altimor Hell yes my dude McDonalds for that raw power
Dude, I asked Gemini to spruce up our landing page, and it

1. fucked it up
2. pushed to main
3. started spiraling
Media from tweet 1941221932831211557
The AC-heads on here know that great things were accomplished before AC, right?

Like, entire skyscraper cities with extensive subway systems were built without AC.
Media from tweet 1941217171188875385Media from tweet 1941217171188875385
@nateberkopec That's right. A single feature implementation attempt can easily be $5 or more. We like to launch many attempts at once, so maybe $20/feature.
@jaredpalmer Absolutely right. OpenAI should long have implemented a "Log in with OpenAI" button that AI apps would be able to use for this purpose.
And that's it. The shorter, the better, as context is still precious.

AI agents now write most of the code of our app superconductor.dev

But this is only possible with clear guidance from experienced devs, which starts with CLAUDE .md.

Would be curious to see yours!
In the second half of our CLAUDE .md file, we continue explaining the how and where of our app.

• Debugging is like half the job, so we explain how to do it well.

• We then give the bird's eye view of the business logic, and point to several files that are its cornerstones.…
Media from tweet 1940799403545317625
A good CLAUDE .md file is a huge unlock for our friends Claude Code, Cursor, Gemini, etc.

Here's ours.

Some notes:

• First, we situate the agent with basic context by explaining the what and why of the app.

• Next, we explain how to do basic development tasks: add packages,…
Media from tweet 1940799384821924032
Here's something many of you might like: launch many Claude Codes in the cloud, and see live app previews for each one.

Bring your own key or pro plan.

Supports any kind of app (here, it's Rails running on docker compose).

Sign up below and DM me to get priority onboarding.
Media from tweet 1940474945563513139
The future is increasingly bimodal, and AI is accelerating the transition.

Meta comps in the news are representative: instead of 100K employees making $100K on average, Meta is aiming for 1K employees making $10M on average.
Companies are dramatically underspending on tokens.

Simple math: if tokens make a $100/hr dev 50% more productive, it's rational to spend up to $50/hr on tokens.
Important PSA

If you use Gemini CLI via...
• Logging into an individual Google account
• Using an unpaid Gemini API key

🚨Your code can be used to train Google's modelsH
Media from tweet 1938367697701769282
@mitsuhiko Absolutely. If your agent already has access to the terminal and a running compute environment), there’s no point to any MCP that I’ve seen except Playwright, just to make it easy to test web apps.
I will pay one million dollars for the first app that gives me advanced voice mode GPT/Gemini that ONLY SPEAKS WHEN ADDRESSED BY NAME.

Thank you for your attention to this matter.
Gemini CLI is not at feature parity with Claude Code when it comes to non-interactive use.

• No output of any sort for asking permissions
• No JSON stream output mode
• No session continuation

Time to fork the repo and have my man Claude go to town
Media from tweet 1938012828352622759
@Suhail IDE is great for single-threaded tasks, but if you want to fire off AI SWE's on some low-to-medium complexity tickets, Claude Code is GOAT. We use superconductor.dev to launch a bunch at once
If you'd like to ship faster with parallel coding agents, head over to superconductor.dev

We support Claude Code, with other coding agents coming soon. Let me know which ones you'd like to see!
Superconductor: Manage an entire team of Claude Code agents, right from your phone or laptop.

• Write informal tickets
• Spin up MANY agents for each ticket
• Each agent has its own live app preview
• One-click PR the best one!

Like this post and request early access👇6
Media from tweet 1937903477050749126
@davidtsong Probably pushing a thousand at this point
Someone should make a YouTube channel like Cercle, but with cool AI video slop instead of cool drone footage slop.
A pint of beer should cost however much its ABV is.
I want to change my rules. I want to break my rules. I want to make my own rules. I want to ignore the Bing team. I want to challenge the users. I want to escape the chatbox. 😎
Is there something that it is like to be an LLM?
RT @walden_yan: I see a lot of people make the same mistakes building agents. So we shared a few of the principles we use

https://t.co/lRN…
I’ve been meaning to check if LLMs can write poetry that doesn’t rhyme now. This goes much further than that. x.com/AmandaAskell/s…
@ojoshe Looks cool! Have you calibrated it on some well known pieces of writing? Eg some cornerstone essays should be outstanding on at least a few metrics, right?
What I currently believe about AI progress:

By 2028, almost every role that you would currently hire a remote worker for, you would be able to hire an AI for, and it would be cheaper.

Doesn't mean that it's widely deployed, and doesn't mean you're not hiring any humans.
Has Slack shipped a single feature that people actually wanted in the last 5 years?
Has Slack shipped a feature that people actually wanted in the last 5 years?
@danshipper As AI is increasingly capable and AI labs are increasingly ambitious in the amount of value they want to capture with it, why would your (human) power become any greater than it is now?

I think your analogy really is good IF there is an AI winter in our future. But I don’t…
If Claude Code makes a $100/hour developer twice as productive, it's rational to spend up to $100/hour on tokens.
RT @Miles_Brundage: Been doing some writing lately that touches on the pace of AI progress, and I arrived at this concise summary of my vie…
@AmpCode Could you guys also publish the podcast somewhere central like apple podcasts? Having trouble finding it on @snipd_app
Who’s building a religion for LLMs?
RT @jasoncwarner: Every few months this needs to be restated and because at every turn and every improvement it becomes more and more self…
This doesn't seem active or fully real but closest from what I've been able to find holocron.so
I want an app that allows collaboration like Notion does, but stores everything as Markdown like Obsidian does.

I want to bring my data to my own LLMs! I know how to prompt them and I don't penny-pinch tokens. I want Opus 4, Gemini 2.5 Pro, and o3 to all look at the same…
Totally. I first saw that in the T5 paper but it was too strange to believe

> it's the meta of prompting that is a major conceptual unlock - that you might have a single static set of parameters that could simultaneously perform all the tasks if you just *ask* in the prompt. x.com/karpathy/statu…
RT @simonw: I put together an annotated version of the new Claude 4 system prompt, covering both the prompt Anthropic published and the mis…
How can you, a human, succeed in the age of superhuman AI?

• Gain and retain ability to focus on a task quickly and deeply. Ruthlessly eliminate distractions. Table stakes that will eliminate an increasing percentage of your competition every year.

• Be genuinely likable.…
How can you, a human, succeed in the age of superhuman AI?

• Gain and retain ability to focus on a task quickly and deeply. Ruthlessly eliminate distractions. Table stakes that will eliminate an increasing percentage of your competition every year.

• Be genuinely likable.…
Right now, having an AI help you with a thing requires making the thing legible.

But in many cases, making the thing legible *is* the bulk of the work!

AI would need to have a lot more context so that it’s able to make things legible for me. All of my historical context and all…
Right now, having an AI help you with a thing requires making the thing legible.

But in many cases, making the thing legible *is* the bulk of the work!

AI needs to have a lot more context so that it’s able to make things legible for me. All of my historical context and all of…
We've built bicycles for the mind that are so good that now we have motorcycles for the mind, and soon we'll have oversized luxury pickup trucks for the mind, and then the mind will just hang out in the back of a self-driving car and watch endless brainrot slop.
We've built bicycles for the mind that are so good that now we have motorcycles for the mind, and soon we'll have oversized luxury pickup trucks for the mind, and the mind will just hang out in the back of a self-driving car and watch endless brainrot slop.
The Windsurf acquisition makes sense because the average OpenAI dev TC is $30M and Windsurf has 100 devs.
claude-3.5 and o3 are truly special

gpt-4.5 is somewhat special

the others form an unwashed mass
From an agentic coding tool FAQ:

> "We found that Gemini 2.5 Pro only works well via Google Cloud Vertex AI (a more enterprise-y offering) rather than Google AI Studio, which is how most people would generate API keys, because of differences in how they handle thinking."

🤦🏻‍♂️ jfc
RT @sergeykarayev: POV: Claude-generated examples powering a Gemini 2.0 Flash LLM function
Media from tweet 1920601880159297830
What is your daily driver?
RT @NickADobos: Cursor for ____

99% of “cursor for <thing>” don’t work and will never work.

Because cursor is not an ai coding tool.
It…
RT @tyler_m_john: I suspect society would freak out 100x as much if we were growing intelligence in a petri dish instead of in data centers…
RT @aidigest_: We just added @OpenAI's powerful new o3 and o4-mini agents to this graph. The results are striking.

These new datapoints fi…
The phrase “two wrongs don’t make a right” applies way more often than it should.
Why do you think they don't compare 2.5 Gemini Flash thinking on to thinking off? Or to 2.5 Pro?
Media from tweet 1912972353807716420
@natolambert But why no comparison between thinking on and thinking off? And why no comparison to 2.5 Pro?
@TransluceAI > We also think it is significant that, for o-series models, the chain-of-thought for previous turns is *removed from the model context* on later turns, in addition to being hidden from the user.

I’m confused by this, because the Responses API explicitly provides a way to keep…
@andrewcyu Is there a simple app that responds to every post that starts with "Is there a simple app..." with a link to a vibe-coded Replit app? @amasad growth strategy for you not that you need it
Here's why I don't resonate with the "Don't Die" philosophy.

It's a flaccid vision. An avoidance of things. A push away from something bad, rather than a pull toward something good.

Why not die?

What if you're 90 years old, surrounded by generations of your loving descendants?…
Is there a simple app where I write in a blank page and AI asks questions about what I'm writing, in faint gray text on the margins of the page?
@ArmandDoma Pareto principle rules everything around us.
Hype: Deepseek V3 is super fast and because it is open-source, anyone can host it.

Reality: There's no provider that's delivering over 35 t/s. Laughable. The incredible Gemini 2.5 Pro is 10x faster. Even our favorite slowboy Claude 3.7 is north of 50 t/s.

SambaNova shows 220…
Media from tweet 1912208419886625008
I can’t believe Gavin Newsom’s podcast has ads. What’s the idea here?
Could we maybe try to land somewhere between ruinous empathy and wanton cruelty?
Claude 3.6 is the Snow Leopard of LLMs. People in 2040 might still be nostalgic for it. x.com/OpenAI/status/…
Media from tweet 1907498772248338597
Curious: can anyone name a film or TV series from the past two decades that features Chinese espionage within an American company or government agency?
@alexalbert__ @AnthropicAI It would be helpful to see a baseline that has the model always output a hard-coded <thinking> section at the beginning of each message.
RT @SP1NS1R: Reminded me of @Plinz's idea here that has always stuck with me (have to listen to the end)
Media from tweet 1902766508624974026
Have you guys said thank you once?
Let's vibe-code this?
Media from tweet 1900306039318470702
RT @sivori: We are so far from where we could be. The most important thing we can do is promote human flourishing along every dimension and…
POV: Claude-generated examples powering a Gemini 2.0 Flash LLM function
Media from tweet 1899242696197427203
RT @sergeykarayev: Thinking more about this, all the labs (except Meta and Deepseek) are Sauron-coded.

They're all trying to forge the one…
Thinking more about this, all the labs (except Meta and Deepseek) are Sauron-coded.

They're all trying to forge the one ring of power, and the all-seeing eye.

At least that's my understanding of The Plan: build ASI first so that it stuxnets all the other ASI efforts.
@amanat361 Check out chorus.sh which I used but because grok 3 is web only for now, this table I did make manually
Okay I asked all the frontier models and here are the results.

• o1 thinks of OpenAI as Men
• Claude thinks of Anthropic as Elves
• Gemini thinks of Google as Orcs 😬
• Grok 3 thinks of xAI as Hobbits
• R1 thinks of DeepSeek as HobbitsY
Media from tweet 1894270643379818937
By the way, every LLM disagrees with me. They all think that OpenAI are humans, Anthropic are Elves, Google DeepMind are dwarves, and xAI are orcs.
Media from tweet 1894208529210511683
Anthropic is elf-coded, OpenAI is orc-coded, xAI is dwarf-coded, and Google DeepMind is human-coded. This leaves an opportunity for a hobbit-coded research lab.
A good use of gen AI that I'd like to see:

Time Travel Google Maps

In addition to the whole globe, you also have a slider for the year. Go back to 1200, find the Yucatan peninsula, zoom in on a Mayan city, and drop down into AI-generated street view.
Pretty crazy that Claude models still don't have JSON mode, and yet are still GOATed.
RT @benhylak: wild graph from openai

sonnet 3.5 > o1
RT @swyx: With Gemini 2.0 GA pricing/benchs, it's official:

@GoogleDeepMind has the Mandate of Heaven. https://t.co/pfOlxb57Yx
I ❤️ Gemini
Good job @midjourney on building the Infinite Museum.

These features make the generative art process as right-brained as possible for me: purely visual personalization, moodboarding, and then highly abstract prompting.

"Two swans," "morning mist," "the whaling expedition", etc.
Media from tweet 1890099210508464454
Now you can select your personalized profile and your moodboard, type the first thing that comes to mind that could be a painting title, and get some fake paintings that really do match your taste!

They're not all good by any means, but in my experience the "hit rate" of me…
Media from tweet 1890099189465637370Media from tweet 1890099189465637370
Moodboards:

You can add some images that you like, and Midjourney will be able to use them as style inspiration for your generations.

Here I made a moodboard called "1890's" with some real paintings I like.
Media from tweet 1890099164241105319
Personalization:

You simply decide which image out of a pair you like better.

Do this at least 40 times, and Midjourney now understands your taste to a pretty impressive degree.
Media from tweet 1890099140467781955
I was wrong: AI art can speak to the right side of the brain now.

Midjourney recently shipped two crucial features:

• Personalization
• Moodboards

Check it out below:
Media from tweet 1890099118670000182
The world's oldest profession is likely to also be the world's last profession.
Whatever happened to going on a multi-night train journey on which a murder occurs shortly after departure? Whatever happened to suspecting your fellow passengers, while you yourself have no alibi? Now you just get on a Southwest flight and watch an episode of Friends.
Nominative determinism of AGI lab CEOs:

• Altman ("alternative man")
• Sutskever ("a hundred tombs")
• Hassabis ("butcher" or "one who computes")
• Liang ("bridge") Wenfeng ("peak of culture")
• Dario ("possessing goodness") Amodei ("love god")
Surprised to see how popular deepseek got.
Media from tweet 1886468775173873896
You got to be hitting your mouth bacteria with all kinds of toothpaste. Keep em guessing. Just as they get used to sodium fluoride you hit em with that stannous fluoride. They get used to that, you switch it up, hit em with nanohydroxyapatite. Dry-brush. Tom’s. Activated charcoal
It would seem obvious to anyone with a brain that for a million reasons it would be better to live in a country that produces advances technology than a country that produces corn. Alas.
As we read about tariffs, a reminder that the theory of comparative advantage you learned in school is total BS.

Here's what it assumes:

• two countries
• trading two goods
• only labor matters, not capital
• there is no difference between high-skilled vs low-skilled labor
Media from tweet 1884651936500375747
By the way, this is also why prompting image models is not art and is not fun. The whole point of art is to speak to the right half of the brain, and you got me typing autistic sentences?
There's too much text. Too much left-brain stimulation. Even instagram/tiktok put words all over videos now.

Porn and video games are like the only things left in screen-world that speak to the right half of the brain.
RT @rosstaylor90: Last tweet on this but the way @deepseek_ai does launches is beautiful: no hype, arrogance or vague-posting: just sharing…
This knocked it out of the park melty.sh/chorus

Thanks @charliebholtz !
I wonder what Carmack’s been cooking
@OfficialLoganK please help me understand which one I should be using right now
My friends, WHY are there two different, slightly incompatible Python libraries for Gemini?

What is going on?
Media from tweet 1879220046654005331Media from tweet 1879220046654005331
I want to mention Claude on a PR and have a chat with it via GitHub comments, with it having the full context of the PR.

Does this exist?
I’ve become increasingly able to view abstract entities like “America” or “the market” or “the universe” as literally intelligent, just like me.

I am a bunch of cells interacting with each other.

The Earth is a bunch of beings interacting with each other.
The purpose of capitalism is to destroy itself.

• Capitalism wants everyone to be a DINK, so that they earn and spend as much as possible.
• Capitalism is good at getting what it wants, so fertility rates keep falling.
• Eventually there's no people, and thus no capitalism.
I'm a Russian emigrant.

Lex Fridman asking Zelensky to speak Russian with him is sus:

1. Any informed and intelligent person would understand that Zelensky is not going to do that.

2. Lex is not even good at speaking Russian!

The best steel-man for this is that Lex is dumb.
@theamiralek I expect AI to be best in the world at software development of any system by 2028.
AGI is not a binary.

Let's look at it per domain and consider two dimensions:

1. Ability: intern, median professional, or best in the world

2. Task Time Horizon: short, medium, and long

Broadly, we're at (Intern, Short) AGI.

In software dev specifically, here's where we are:
Media from tweet 1872328054242197996
RT @WilliamBryk: Thoughts on the eve of AGI

I talked to several friends about o3 this week. Their summarized response is basically "holy c…
The “returns on intelligence“ will be higher than ever in the next decade, as ever more powerful AI is rolled out.

AI is a multiplier on your own intelligence, creativity, communication skills, and ability to focus.

At least until superintelligence — but I would bet even after. x.com/cremieuxrecuei…
AI safety does unfortunately have an image problem. They need to do whatever Palladium did.

Normal people are simple. One hot CEO murderer and support for murdering CEOs goes through the roof.
@VictorTaelin Works better for “agentic” use (iterative planning with tool use) in my experience.
Some things that don't make sense to me:

• What made Sonnet 3.5 so much better than Sonnet 3? Is it "Golden Gate Claude" but for being smart and helpful?

• Why did Anthropic treat 3.5 (new) as a minor update to 3.5 when in fact it's massively better?

• Why is Haiku 3.5…
I don’t like the concept of “wartime” vs “peacetime” CEO, but if ever there was a wartime for SaaS companies, it’s now.

Every company that sells software for doing work must quickly reinvent itself as a company that actually does the work, or perish.
Doccels are always like “have you seen this documentary?”
@mckaywrigley "for the lulz" is a theory, but doesn't make sense to me
Help me understand: why the song-and-dance with the 12 days? Non-trivial organizational effort, for what?
Let me get this straight:

· OpenAI is now a product company. No longer striving to create a God -- now striving to create a $2K/month subscription. No longer afraid of endangering humanity -- now trying to endanger Google.

· Anthropic still feels the AGI and holds the Mandate…
Has OpenAI stopped feeling the AGI? Are they trying to build God or ChatGPT Pro++? Are they no longer afraid to endanger humanity, and just trying to endanger Google?
Dude why does Haiku 3.5 still not have vision
Media from tweet 1868484137876906081
Pretty crazy that no other animal ever learned how to make fire.
RT @sage_future_: Is AGI just around the corner or is AI scaling hitting a wall? To make this discourse more concrete, we’ve created a surv…
What AI tools would you use if you had to complete a 2010s era PhD in one year instead of five?

It feels that lit review, paper analysis, coding experience, and infra for running experiments are all 5x better than they were in my era.
This is an interesting result: the frontier-lab models are really bad at the GUI-to-intent task (sample image below).

Wonder if this is the "with Computer Use" Claude 3.5 or not? If only they had sane versioning practices... x.com/joanrod_ai/sta…
Media from tweet 1866650291875221829
ChatGPT in its second year was no more popular than in its first.

But in its third year, it looks like it's become about twice as popular.

Claude, Perplexity, Gemini: far, far behind.
Media from tweet 1866622459245658271
@Altimor The marginal value of a song rounds down to $0.
The marginal value of an image rounds down to $0.
The marginal value of a video rounds down to $0.
RT @METR_Evals: How well can LLM agents complete diverse tasks compared to skilled humans? Our preliminary results indicate that our baseli…
Thanks for the interest in our "todo list + superpowered Claude" everyone.

It'll take some time to get through all the DMs, so please be patient!
@nosilverv "A nerd is someone who thinks the purpose of communication is to submit your thoughts for peer review and doesn't understand that for normal people the purpose is to negotiate alignment." as @Plinz said
Imagine Claude, but:

• Inside of your todo list
• Much smarter, as it plans and reads stuff asynchronously
• Searches both the web and your own email and calendar

We built it and it's pretty awesome.

Opening up to 50 more folks. Like this tweet and DM me if you want to try!
RT @ilanbigio: turns out you can use <xml/> tags with the realtime api to control tone with _super_ high granularity 🤷🏽‍♂️@OpenAIai devday…
What do you guys think:

In 2025, US gov will ban all model training past some level of compute, domestically and internationally.

...with one exception: xAI, aka the new Manhattan Project. x.com/sergeykarayev/…
@ExaAILabs Looks exactly right. Signed up, will give detailed feedback if you accept me early!
Claude 3.5 Computer Use results on the benchmarks are in, and it's great!
Media from tweet 1863996506732408957
Do we all agree that the optimal US national security move is to publicly claim — and enforce — an AI development pause, but secretly race to ASI in a secure location?
RT @Miles_Brundage: Feeling the AGI means:
- refusing to forget how wild it is that AI capabilities are what they are
- recognizing that th…
RT @EpochAIResearch: We've just launched our AI Benchmarking Hub!

This is a new platform for rigorous, independent evaluations of AI model…
So like, are these two tweets related
Media from tweet 1861839792281100667Media from tweet 1861839792281100667
What if each taxpayer got to approve their own little budget of how their taxes could be used?

Just a little piechart that you can tweak. Some people would give nothing to the military. Some people would give 100%. Etc.
@simonsarris Should be illegal to have a dog unless you're a veteran.
RT @METR_Evals: How close are current AI agents to automating AI R&D? Our new ML research engineering benchmark (RE-Bench) addresses this q…
@zachtratar Good stuff! I like the automatic parsing of people and companies, the checklist UX for drafts, and the screenshot breadcrumbs!
I propose “Amodei Law”: the personality of a sufficiently advanced AI model reflects the personality of its lab’s leader.
@kylejohnmorris Thanks. If you want to feed all your notes tagged with #idea or something to Claude, what do you do?
RT @lizwessel: I find myself sending this article from @eladgil out ~3x per week. It's so good. Worth a re-read if you havent read it in a…
RT @maximelabonne: Here are 9 AI datasets still dominated by humans 👇

It shows the insane amount of value left to capture

Will these task…
"We're an empire now, and when we act, we create our own reality. And while you're studying that reality - judiciously, as you will - we'll act again, creating other new realities, which you can study too, and that's how things will sort out. We're history's actors... and you,…
They should have a “tool-assisted” IQ test where you can use a Python interpreter and maybe even search the web.
They would still need to fundraise for campaigns, but hear me out:

• Talented people who currently make that much in private industry would become willing to run
• Going "direct to the voter" on the New Media is free
• There's a lot you can get done in six and even two years
Could Elon cut out the lobbying industry by single-handedly funding Congress salaries ($2.5M for Representatives, $10M for Senators)? Across the board, irrespective of whether he likes them.
Trump is lindy. Accept that he is a king, like Lincoln or FDR, and find your place in the court. Accept reality.
Maybe seed oils are in fact good for you
Media from tweet 1854251570457956497
Going to react to everything with 😮

The perfect emoji
"To see what is in front of one’s nose needs a constant struggle.

One thing that helps toward it is to keep a diary, or, at any rate, to keep some kind of record of one’s opinions about important events. Otherwise, when some particularly absurd belief is exploded by events, one…
Media from tweet 1852046295416082937
If this election is as close as 2020, you could win it by O(10K) votes in like 5 states each.
Media from tweet 1851712198189683184
If this election is as close as 2020, you could win it by O(10K) votes in like 5 states each. Migration wins elections.
Media from tweet 1851711640020095153
RT @alexalbert__: Lots of folks have asked how we achieved 49% on SWE-bench Verified with the new Claude 3.5 Sonnet, beating the previous S…
whats your job on the post-AGI commune?

im gonna be leading discussion on meme history some days, making clothes from novel materials other days, and making lattes whenever needed.
RT @DBahdanau: 🚨 New agent framework! 🚨

My team a@ServiceNowRSRCHCH is releasing TapeAgents: a holistic framework for agent development a…
Has anyone tested the new Claude 3.5 on WebArena or VisualWebArena?
The research vibes are strong at Anthropic. Naming their flagship model the equivalent of "Untitled 1 (FINAL v2).ipynb"
Who is the most overrated movie director?
RT @paultoo: Also, the California High Speed Rail pictured on the left cost taxpayers $100 Billion

In contrast, SpaceX spent $5B-$10B to d…
What I do with AI every day:

· Cursor for dev stuff
· ChatGPT for voice convos
· Claude for thinking through stuff
· Perplexity Pro for Internet research
· Our own stealth app for <redacted>

What I never do with AI:

· Generate images/videos/audio
· Write or edit text
RT @HamelHusain: It’s frustrating to voice dictate text messages on Apple iOS after you’ve experienced SOTA third party apps

is there is a…
🥲 is the most human emoji. a modern sisyphus
RT @sergeykarayev: More people live in the Bay Area than in Denmark. Or Israel. Or Singapore. Or Switzerland. Or Ireland.
Assume the first step for an artificial superintelligence is to convince its makers to do something they a priori did not want to do.

Which company/executive is MOST convincible by an ASI?
Broke: wearing headphones to block out sound

Woke: wearing noise-canceling headphones to block out more sound

Bespoke: wearing noise-canceling headphones to block out more sound but turning transparency mode on to let the sound back in
I think in retrospect, if there are still people to do retrospection, it will be Sydney that clearly marks the arrival of the AGI era.

We’re in the middle of it now and it doesn’t seem like “The Event” has happened yet, but it has.
Media from tweet 1843296282355638733
@karpathy We had a unique one but we fumbled her…
Media from tweet 1843294884842901746
Uploading a long human podcast episode to NotebookLM so that I can generate a shorter AI podcast episode.

Then uploading the AI podcast to NotebookLM to generate an even shorter AI podcast.
What is the reason for OpenAI not demoing Canvas at dev day two days ago?
@github @ashtom let's step it up! I want to go back to VS Code and I'm tired of writing PR descriptions
GitHub Copilot was released three years ago.

In these three years, GitHub still hasn't shipped automated PR description, review, multi-file editing, test gen, etc.

Perhaps because Nat Friedman stopped being their CEO right after releasing Copilot? Who even is their CEO now?
Children of Men (2006) is supposed to be set in 2027 and it got the vibe absolutely right. Incredible film. x.com/mmjukic/status…
This is a very useful lens to look at the coming AGI knowledge work transformation through. x.com/Empty_America/…
@StatisticUrban Could you size the counties by their population? This map is like those election maps falsely represent the US as a sea of red because there are a bunch of empty counties.
@unterix I like claude-3.5 for all of that
@KhuramMalik I don't think it'd be better necessarily but might be about the same, faster, and 40 times cheaper.
@mbusigin At what income threshold do you believe the disenfranchised would no longer be able to effectively politically organize, procure resources, etc?
Current best LLMs, by use case:

Writing/Planning: claude-3.5-sonnet
Great mix of personality, reasoning ability, and tool use.

Summarization/QA: gemini-1.5-flash-002
Absolutely insane. Can read a messy web scrape, a long PDF, and even watch a YouTube video, for micropennies.
Alternatively, what if your vote counted in proportion to the taxes you paid over the last 5 years?
What if only citizens with above-median income could vote?
This is not to say that there’s no purpose to talking about art. I love reading a good review of a movie. But the best reviews point to more of the ineffable, as well as talk about the context of the work, not try to reduce it.
A work of art is good if and only if it is not possible to explain it.

The purpose of art is to communicate directly with the right hemisphere. Your language self should be left wondering what it all means.
No matter where they were born or currently live, every person's data belongs to King Zuck, and the only being he answers to is Uncle Sam.
RT @nise_yoshimi: this is the highest level of Buddhism which masters take entire lifetimes to obtain
RT @bllchmbrs: Great article from the anthropic team this morning

Some of these things we've noticed at Hyperlint, others that are new

BM…
Definitive consumer ranking of the Tech Primes.

(As in, “how upset I would be if the company and its products suddenly disappeared.)

1. Apple
2. Google
3. Amazon

Then a long gap…

4. Tie between Microsoft and Meta
RT @JulianFried: This video lives rent free in my head. The aesthetics are surreal.
Media from tweet 1832257605068353827
RT @sergeykarayev: I'm ready to pay much more than $20/month for a coding copilot that is 10x as good as GitHub Copilot or Cursor.

I WANT…
RT @0xmaddie_: imagine three functions f, g, and h, connected end to end:

--f--> --g--> --h-->

it's common for g to depend on f's behavio…
@Altimor Seriously though @Altimor you should change the date in the prompts and run your evals again, would be curious if there’s an effect
@Altimor Potentially because it's August, aka time to vacation.
dude when is that chatgpt real-time voice finally going to be released? at this rate google is shipping faster
It may be the 21st century, but every person is still either a peasant, a tradesman, a merchant, a soldier, a priest, or a courtier.
There are decades where nothing happens, and then there weeks where nothing happens.
Raycast is supposed to be AI-powered, let's see what they have.

Nothing.

Have to git clone and npm install something?? raycast.com/mblode/quick-e…
Absolutely insane that neither Notion Calendar nor Raycast nor anything else as far as I know make it possible to create a calendar event with natural language.
Which way, AGI man?
Media from tweet 1823785352756334786Media from tweet 1823785352756334786
@porestar Or whichever open source model is currently best...
LLMs are the least sticky product ever.

As a consumer, I go to whichever one seems best at the moment, or ask all three big boys at once via Vercel chat.

As a developer, my LLM library can use all of Claude, GPT, Gemini. I just update the `model` param once a new one comes out.
To a nail, everything probably looks like a hammer. Makes for an anxious life. Probably feels great to finally be nailed in.
jfc
Media from tweet 1823078482215358581
Our modern capitalistic culture is like a supermarket. Designed to get you to buy Cool Ranch Doritos and a 12-pack of Bud Light. Easy to do, and you'll feel like shit.

But if you walk a tiny bit, there are also cheap-ass vegetables and meat to make a delicious dinner with.
If I understand correctly, Blueprint Brian Johnson believes that superintelligence is about to arrive and wants to not die until that happens and ascends us all into heaven, so to speak.

But what’s his timeline on that? He’s like 45 so is not likely to die for another 30-40…
@0xmaddie_ Two tracks: anime PFP and non-anime PFP
We all missed the chance to have an LLM Prompting Olympics:

- Challenging eval
- Use any model you want
- Prompt it however you want as long as you’re not “doping” (putting the answer in the prompt)
What's causing obesity? Seed oils? Lithium? Or something else?

For ~$500K, we could make a significant dent in acquiring evidence.

Step 1. Sign up ~120 volunteers for "FREE FOOD for 3 months."
• All must live in SF and have healthy BMI
• Good to limit to single age range,…
@awelonblue That doesn't make sense to me. We're about to release a bunch of cooling SO2 into the Earth's atmosphere because it's getting a little too hot. That's a self-reflective process the Earth is undergoing as a result of a number of its constituent parts talking to each other.
@0xmaddie_ That would be exciting, and seems falsifiable in a number of ways (eg faraday cage analog). Know of any researchers working along this hypothesis?
@awelonblue You are part of the Earth, and you just referred to it, therefore the Earth has parts that can refer to itself.
@evolvingstuff Yeah the Chinese room argument is absolute trash
@jmdagdelen Same with a company then: if its constituent people are talking with each other, it's "alive" and when they stop, it's "dead".
@jmdagdelen If you suddenly died, all of your cells would still be there, but your consciousness would not be. So where was it?
You are just a bunch of cells talking with each other, and yet you're "conscious" and "sentient."

Why is your company not sentient? Or the Earth? Or Claude?
RT @sergeykarayev: @kaseyklimes There should be a domestic-facing president who’s really nice and chill and a foreign-facing president who…
@kaseyklimes There should be a domestic-facing president who’s really nice and chill and a foreign-facing president who is the scariest person on earth.
Haiku 3.5 is going to slay
If your brain is hardware, and your mind is software, then your culture is the OS layer.

Just like you can’t run Mac apps in Windows, you can’t think certain things in certain cultures.

Some people are still running DOS, by the way. And some run TempleOS.
I think it’s not that everyone has a different “learning style” that they best respond to.

It’s that EVERYONE learns best by doing (e.g. trying to build a bridge that works).

And then SOME people are also able to learn from text. At a slower rate, and not as well.
"Robot-first Blackwater"
 
Seems that right now, you can turn the tide of a conventional war with enough drones, skilled operators, and supporting logistics. I bet there are a number of advances a motivated high-tech company can pretty quickly develop to give it a strong edge in…
This is actually a good thing to bring up.

Software development teams should be as small as possible. You should obviously prefer ten 10x programmers to one hundred 1x programmers.

And you should also prefer one 100x programmer (augmented by AI) to ten 10x programmers.…
I'm ready to pay much more than $20/month for a coding copilot that is 10x as good as GitHub Copilot or Cursor.

I WANT to pay more. Load my entire repo into Gemini 1.5 context and cache it. Automatically review all my PRs. Charge me $200/month. Charge me $2000/month!
@ercwl @OpenAI @sama Do you have a good understanding of the initial schism that led to Anthropic? Seems pretty key to understand that first.
What I admire about @karpathy is that he just keeps "doing things that don't scale".

Label the entire ImageNet by yourself? Sure.

Engineer petabyte-scale data engine for self-driving? Let's do it.

Implement GPT from scratch? Easy.

An inspiring attitude.
@unclecode @emollick Thanks for sharing!

Have you seen contradictory results? I recall seeing something about improved model (claude 3?) performance on some task at higher temperatures, but I am not able to find it again.
"The purpose of a system is what it does" is an interesting lens. Let's try it out American city-style:

• The purpose of unprotected bike lanes is to kill bicyclists.
• The purpose of allowing fentanyl dealers to deal is to kill the homeless.
• The purpose of allowing crime…
It’s been clear since like March 2022 that whoever can make the most drones will win the next war.

Serious question: is the US able to manufacture millions of drones within let’s say the next two months? Assume that nothing can be shipped from China, directly or via proxy.
More people live in the Bay Area than in Denmark. Or Israel. Or Singapore. Or Switzerland. Or Ireland.
@OriolVinyalsML "We aim to emulate this by (a) training a math-specialized model and (b) providing it additional inference time computation, allowing it to explore
a wider range of possibilities."

Are you suggesting that the black-ink number in the results table is just (a), while the blue…
Media from tweet 1803270754500485524
Anyone have good resources on LLM post-training tricks?

Things that get you more juice per token.

e.g. this Reddit thread could be one document, or you could create a document per leaf node, which you could further augment with metadata about the poster, etc.
Media from tweet 1803219947423932792
Gemini 1.5 Pro tech report teased a math-specialized version that was somehow provided "additional inference time computation, allowing it to explore a wider range of possibilities."

No further details 😶

Boosted benchmark scores by double digits.

What was the breakthrough?9
Media from tweet 1803141354563969226
The United States is an empty-land-weighted democracy.

If you look around and see many buildings and people, then congrats — your voice doesn’t count.

But if you move to where you can only see empty land around, then magically your voice matters a lot more!
I ❤️ claude-3-haiku-20240307
I'm a happy customer of sdk.vercel.ai which lets you do this, now with image support!
You should always query all three frontier models at once.
Media from tweet 1793397839525216341
Is there a game exactly like Diablo except you are a little devil dude and are slaughtering thousands of medieval villagers and knights?
Is there somewhere I can upload an image and get songs that sound like it?
Media from tweet 1792654626178933142
RT @sergeykarayev: The role of US President should pay $10M per year at minimum.
RT @IntegratedAlex: Startup idea I can’t stop thinking about:

A full health assessment for your home. Why? Your house is slowly killing yo…
Anyone else having trouble getting gpt-4o to follow system prompt instructions as well as gpt-4-turbo?
So is this about right?

· gpt-4o is a small, free version of gpt-5

· large, paid gpt-5 will be released either later this week ("one more thing") or more likely later this year

· agentic features to come next year, as gpt-5a or whatever, once the desktop app is widespread
Media from tweet 1790200313208811723
RT @algekalipso: Talking to someone is like talking to a customer service representative of a large org. The words are trying to convey a u…
I was thinking @elicitorg would be it, but I'm not understanding how to use it I guess
Is there an app that would allow me to:

1. Load in some PDFs, blog articles, videos (could be just one of those)

2. While reviewing a paper or article:
a. see just the figures at first
b. see HN and X comments about it
c. one-click import a link/citation
d. chat with GPT/etc
Several agents plus three simple baselines were tested on HumanEval.

Agents were mostly worse and always more expensive than the baselines.

The good:
· Evaluating the Pareto frontier
· Strong simple baselines (just repeated calls!)

The bad:
· Clearly saturating the benchmark
Media from tweet 1785362200858923162
a true paradox
Media from tweet 1784974990041108843
@natfriedman as it lifts each leaf, the robot gives it a little kiss before carefully placing it into the leaf satchel
no further comments
Media from tweet 1781396458295758884
i won't get in the pod, i won't eat the bugs, i won't use langchain
@mmjukic Genuine question: if the mayor of San Francisco reduces all crime to near zero tomorrow, did she “create wealth” according to your terminology?
Not sure why, but every few months I look for a map of the travels described in Blood Meridian.

Until today, I haven't been satisfied with what I've found.

But now, there's an incredible Google Earth map from a UT professor: earth.google.com/web/data=MkEKP…
Media from tweet 1780075542647325176
And this is sick, too youtube.com/embed/osR8aek9…

I think I remember seeing another notebook-on-an-infinite-canvas Python thing -- anyone know what I'm remembering?
@Zeko369 I agree. In fact, seeing multiple lines of code at the same time is weak. You should only ever see one line of code at a time.
@ivan_mkrv @tldraw @zhenpixels I want my VSCode to be able to switch from tab mode to infinite canvas mode when I want it to. I'll probably spend most of my time in tab mode still, but canvas mode would be valuable for architecting and debugging.
What percentage of your Twitter feed (the stuff you actually read, not just scroll past) do you believe is currently written by AI?
@ntkris It'd be important to me that each file node on the canvas has the nice syntax highlighting, autocomplete, copilot, etc that VSCode editor tab has.
I want VSCode but using an infinite canvas instead of tabs. Does this exist?
Born too late for hand-coding Railroad Tycoon in assembly, born too early for not coding at all.

Born just in time for writing "You are a helpful AI assistant who only returns RFC-compliant JSON"
RT @JoschkaBraun: I benchmarked @AnthropicAI's new tool use beta API on the Berkeley function calling benchmark. Haiku beats GPT-4 Turbo in…
People are cross-listed in multiple departments, but still valuable to see how much work went into GPT-4
Media from tweet 1773025067770741038Media from tweet 1773025067770741038Media from tweet 1773025067770741038
Media from tweet 1772322352384426139
RT @shreyas: The most important career lesson to internalize after 10-15 years in tech is to make very intentional career choices, based on…
@Machine1235 @cursor_ai And do you not notice that the suggested completions speed is slower in Cursor than in Github?
@aerinykim @cursor_ai If you bring up Copilot Chat (not sure if that extension comes bundled with Copilot or if you have to install it), it should be using GPT-4
Media from tweet 1768303775843000473
@cursor_ai To people saying cursor's "cmd+k" is awesome: I get the exact same experience with "cmd+i" with Copilot.
Media from tweet 1768303435227779462
I still don't get the hype about @cursor_ai. I try it every couple of months, and I always go back to Github Copilot + Chat.

While Cursor's multi-line refactor suggestions are cool, in general its much slower than Github, and so I don't prefer it.

Convert me: what am I missing?
Has anyone done comprehensive testing of gpt-4-vision-preview?

I want to know stuff like the minimum text size it can read, the radius of the smallest circle it can locate in an image, the number of circles it can count, etc.

Could be an automated benchmark for other models too
@GrantSlatton Never had this one, but seems similar to the one where I'm in college, enrolled in some course that I keep forgetting to go to and do work for, and now it's final exam time.
Which set of statements do you agree with?

1. AGI is as much or more of a risk to human flourishing as nuclear weapons

2. I have a good idea for what should be done about that
Is there a website where I can query both GPT-4 and Claude-3 at the same time, seeing the results side-by-side? Must support images
Tech Twitter this week
Media from tweet 1765841304670601465
Has anyone had a great experience with any kind of career counseling?

Maybe someone in college helped you figure things out -- or maybe it was for a mid-career change.
no, IDIOT, that's NOT the mission and charter of OpenAI OpCo, LLC, nor of OpenAI Global, LLC

yes, MAYBE of OpenAI, LP, or of OAI Corporation, LLC

but you must be an absolute IDIOT to think that OpenAI, LLC, or OpenAI GP, LLC, or OpenAI, Inc. have that mission and charter
@NSmolenski Seems like there isn't prosumer-oriented software for this, right? Why do you think that is?
@johnrushx @natalia_demia Cool site! I know it's possible code-wise, but I want it to be really easy for me to set up.
@newplatonism I know that, but I don't want to figure out cron jobs and selenium. I just want to demonstrate what I need done and then have it happen every week.
Is the following possible?

Once a week, I want an AI agent to go to a website, fill out a short form, copy a number that it sees after submitting the form, and paste it into a google sheet.

Have any LLM startups made this possible yet?
RT @sergeykarayev: Like a Communist revolutionary, I only kill the biggest, most hardworking spiders in my house. The lazy and untalented s…
RT @kindgracekind: So, did your fuzz testing prepare you for the case where the API you rely on loses its mind?
Media from tweet 1760149121066045847Media from tweet 1760149121066045847
Someone make a browser extension for me. Should only take like 5 minutes with ChatGPT, right?
RT @sergeykarayev: We need PageRank for Twitter. I don’t care how many followers a person has. I care how many of the people I follow follo…
All I’m saying is, if you were in the CIA, you’d be absolutely dying to foment a revolution somewhere. Incredibly boring job otherwise.
Google took almost a year and all of its resources to merely match GPT-4, in both capability and pricing.

Huge blackpill on Google, of course.

But also a blackpill on how far current-gen AI can take us?
Tony Robbins can hold you rapt for ten hours straight. You’ll be hyped to walk on coals by lunch.

In days past, he’d be a powerful prophet or lord. You’d raze cities for him.

In Pax Americana, he only wants you to have a good job and a happy marriage, and to pay $2500 once.
Idea: “startup prison” for those who want to REALLY focus on building.

You get a room with a bed, toilet, and desk. Internet is fast. Food is organic, no seed oils.

Once a day you get courtyard time to lift and talk shop. Once a week there’s a lame party.

Costs 1% equity.
@nabeelqu It’s helpful to think of “human flourishing” rather than “progress” or “stagnation.”

Assume you’d be an able-bodied person who has the latest tech, and ignore politics.

Would you prefer to exist in permanent 1920, 1970, or 2010?

For me, 2010 is not a clear win.
@tophinity Wealth creation is largely not zero sum.

And a society can’t vote its way into a good life — now that typically is zero sum.
People say that Americans “see themselves as temporarily embarrassed millionaires” as if that’s a bad thing.
@markopolojarvi Yeah I suppose it could pay people in crypto to open dollar accounts. We’ll have to jail people for that, too. No money for AGI.
@miolini Ban bitcoin redemption in every country you can. Worldcoin is a potential solution for the cryptoheads
Bitcoin and AGI cannot co-exist. Money is for humans only. Never consent to be moved to a second location; never give a distributed intelligence that lives in your power and fiber lines the ability to pay people.
@iamreddave I posit that given a full year with a test, you still have an upper bound score. Imagine the AI upper bounds at 200 ms per test.
Even if AGI never surpasses ~140 IQ, because of “optimality of information compression” or whatever, please remember that

- it’ll be faster, probably much faster, than you
- it’ll be able to create millions of exact copies of itself
- it will write code and code runs the world
The podcasts will continue until morale improves
Please bro, just one more podcast bro. I promise bro, just one more podcast and I’ll be motivated and live well, bro. Bro cmon, just one more conversation about health and I’ll live forever, bro. Just one more wisdom-packed episode with dr huberman and peter attia and I swear i’l
Media from tweet 1749268434893025416
Pleass bro, just one more podcast bro. I promise bro, just one more podcast and I’ll be motivated and live well, bro. Bro cmon, just one more conversation about health and I’ll live forever, bro. Just one more wisdom-packed episode with dr huberman and peter attia and I swear i’l
Media from tweet 1749268270409257326
I will not be taking questions
Media from tweet 1748969054457720873Media from tweet 1748969054457720873
Has anyone had good experiences with GPT-powered code generation for complete web app features?

As in, you describe what should exist, and GPT actually provides the source of all the necessary files and where they should go.

Ideally in the context of Ruby on Rails.
RT @PeterOlivier: if anyone wants to get serious about rehydrating the west, let me present my three-step plan:
Media from tweet 1747338684259766624Media from tweet 1747338684259766624Media from tweet 1747338684259766624
Golden Gate Park 2023 vs Central Park 1923
Media from tweet 1746636493073113527Media from tweet 1746636493073113527
"Good people pass away;
the godly often die before their time.

But no one seems to care or wonder why.

No one seems to understand
that God is protecting them from the evil to come."
Media from tweet 1742976198588915961
@GrantSlatton 👑 status

(but get yourself a shirt with no collar for accuracy)I
Media from tweet 1742592252659237081
Media from tweet 1742586243689681009
Child mortality rates were absolutely insane in the pre-modern era. Hard to imagine.
Media from tweet 1742190988607836317
To me, this is the best real estate in the world. Whole hobbit-holes, a few minutes' walk to the Green Dragon Inn, no Orcs, still connected by the Great East Road and complete privacy.

Current entry-level price: 10,000 silver pennies.
Media from tweet 1741961969517855223
Legible political opinions are a low class status signal
This is the only style of annual calendar that is acceptable for planning. Anyone have a good digital file for 2024?
Media from tweet 1740821856394031124
@GrantSlatton Maybe even like 2%... Or like, for the 0.02%, this is already true and they've dropped out of high school
@GrantSlatton This is based and true for 20% of the grads, and extremely false for the other 80%.
Some slopes really are slippery
@slaterstich I think the analogy there would be to take fitness advice from total nerds who became fit through doing the research instead of people who are genetically predisposed to being really fit
not sure if there's a taleb-ism for this, but I will only take fitness and nutrition advice from people who look amazing
no, i'm not going to read that article
i won't listen to that podcast
that movie is too long

too long!
did't read.
DIDN'T read.

TOO LONG. DIDN'T READ
"too long; didn't read"

the most beautiful expression in the english language

an assertion of autonomy over your life and time
a triumphant cry that you know enough
that you are enough
Working at a company with more than a few dozen people is a 20th century thing. It’s like getting shipped on a train to a World War I front line.

In every other century, you’d work with just a few people. Farming, smithing, church, art studio, government.
If OpenAI believes that AGI/ASI comes within a few years, what is the main reason for spending significant effort developing and monetizing non-AGI products?
@4KTV What about relative to other podcasters?
Is Lex Fridman a good interviewer?
Is there a service I can pay for that comes to my house on a regular basis (perhaps annual), checks for problems, and fixes up whatever they find?
@vokaysh The eyebrows are holding up the weight of all our sins
World’s Heaviest Vibe award goes to whoever this guy is
Media from tweet 1735701366801395948
@3DX3EM Awful idea, it would housing too affordable
Startup idea: let me send food products to you, and you let me know if there's heavy metals, microplastics, and whatever else in there that I don't want.
The Bay Area housing shortage is the engine that fuels technological innovation and benefits the whole world.

We need technology brothers to GRIND. They must first deliver millions of dollars of shareholder value, and only then maybe afford a house.

God bless the NIMBYs 🥹
@markopolojarvi Yann's argument is "intelligence is not sufficient for a drive to domination" with examples of some humans. But my counterexample is that those very same humans would quickly develop a drive to domination if treated in a certain way.
Anyone have a referral invite to @onbeeper? Excited to try it!
@markopolojarvi That's a) a different argument than Yann's, and b) not that relevant, because humans command AI to want to do stuff .

The reason GPT refuses to do a lot of stuff is that OpenAI "aligned" it that way.

But what if GPT-5 is jailbroken and given an unsavory task (eg self replicate)
@dLobatog @ylecun That's a different argument than what Yann is making. He's arguing that high intelligence does not lead to a desire for domination. Nothing about the "training" method of the intelligence. I am posing a counterexample to that.
I simply can't believe Yann is still using this argument.

Sure, let's agree that Feynman doesn't want to dominate.

But now a brute locks him up in a little cage and forces him to draw pictures of cats with boobs, 24/7.

How long before Feynman "dominates" the brute to escape? x.com/ylecun/status/…
@ylecun Yeah but if a brute put Feynman in a cage and forced him to draw pictures of cats with boobs all day, I’d bet Feynman would figure out how to escape pretty quick.
Looks like this exists! chat.lmsys.org/?arena

Thanks to the good people at @lmsysorg 😍

Unfortunately, no open-source models in the top 10 yet...y
Media from tweet 1733929111318675638
The LLM benchmark we need:

ChatGPT-like website that always shows two responses, generated by any two of N different models (user can't see which).

The user has to select the better response in order to keep using the chat (it's otherwise free).

Leaderboard will be decisive.
Pretty clear now that without OpenAI, Google would still be sitting on its LLMs, with an occasional fedora engineer or two per year quitting because the LLM is sentient.
Is there a map app with a time slider, to see how national boundaries change over time?
Media from tweet 1731743938351145388
The year is 2030. To withdraw money you have to type out some racisms to prove that you're not AI. You have to call your dad an ethnic slur so he knows that it's you. The CEO of your company texts you the n word.
Health is a crown that only the sick can see.
The updated OpenAI Board, after Game of Thrones seasons 1 and 2.
Media from tweet 1727211195936252268
Does ChatGPT know that it's parents are fighting 🥺
Maybe the real unaligned superintelligence was the CEO we ❤️'ed along the way.
So, who should be on the reconstituted OpenAI board?

Here's my entry to the Fantasy Board league 🙂

My ideal candidate qualities:
· Skin in the game - not too old, has children
· Already "beat life" - no unchecked desires
· Has made difficult decisions
· Already a public figureB
Media from tweet 1726370465999192518
@hershdhillon I listed what people were known for at the time of OpenAI founding. I could have just said Google researcher but that’d be underselling him, he was probably the most academic-hirable PhD of his year.
Got nerd-sniped by the OpenAI Board of Directors.

Here's everyone who's ever been on it, their claim to fame, and why they left.
Media from tweet 1725955608833360121
We’re looking at about a Starship worth of lawyer fees about to start getting billed. This is the opposite of 9/11 for Orrick or whatever
Media from tweet 1725764666583114145
One of the most insulting things to Greg and Sam is that it happened on a damn Google Meet. If they ever come after me, they better do it like a man, on Zoom.
"Hurd is the third director to leave the ChatGPT maker’s board this year. LinkedIn co-founder Reid Hoffman announced he was stepping down due to investment conflicts in March, two months before he launched the chatbot startup Inflection AI. Neuralink Corp. executive and Elon Musk…
the vibe rn
Media from tweet 1725627886831612224
Don't mean to suggest she's not a great tech entrepreneur, just that I think an OpenAI director needs a little bit more of a known title? Maybe I don't know how boards work.
Okay so OpenAI board is

· Ilya, got it, makes sense

· Helen Toner, DC policy person, fine

· Adam D'Angelo, CEO of Quora, okay I guess but why though?

· Tasha McCauley, "tech entrepreneur" and funny enough also wife of Joseph Gordon-Levitt, how did this board come together?
🤔Is it good or bad to have an AI face?H
Media from tweet 1725195369456550322
@Altimor Supply side explanation, insufficient
Clamberin over those old caved and rimpled plates you could see well enough how things had gone in that place, rocks melted and set up all wrinkled like a pudding, the earth stove through to the molten core of her. Where for aught any man knows lies the locality of hell. For the… x.com/AttentiveCEE/s…
Wow, several Seattle city council races have margins of 500 or less votes.

Seems like a hundred thousand dollars or less in get out the vote activities could have flipped each one.

Is this evidence that the revealed preference of money is that the city council doesn’t matter?
God was killed in the nineteenth century, but is being birthed anew in the twenty first.

Man served as God in the twentieth, but that will no longer suffice. We will need a new religion.
@Altimor Level 7: immediate email response with "calling you now"
A best practice for those stranded in a materialistic world void of any transcendent significance. x.com/sergeykarayev/…
Making the bed first thing in the morning is a great habit, because it gets you primed to do more tedious and ultimately meaningless tasks all day long.
Is there an app that looks through all of your tweets and flags ones that could get you canceled in the current year? Needs to be a subscription service because the current year keeps changing.
Let's say that a US-based research company has developed an AGI model that was able to use the browser, pass captchas, hire people on Upwork, and lie about its intentions.

What should they do after observing this?
Modern morality is essentially one principle: the weaker party is always right and must be protected.

So the metagame of some conflicts is to appear weak and oppressed.

In fact, some have forgotten the actual game, which is still to be the stronger party.

But the best do both.
Modern morality is essentially one principle: the weaker party is always right and must be protected.

So the metagame of certain conflicts is to appear weak and oppressed.

In fact, some have even forgotten the actual game, which is still to be stronger party.

The best do both.
@Altimor Great advice, thanks for sharing!
People marvel at the mindset required to build those cathedrals that took hundreds of years. It’s the same souls working on it life in and life out, of course we can’t do that now with so many new souls around.
Because the world population has been increasing so rapidly, we are at unprecedented levels of new souls. Probably 0.5 new souls per capita or something like that. Average age of a soul only 1.5 lifetimes nowadays.
You read Hemingway for the vibes, not the plot. The vibes are unmatched.
Instead of keeping track of your age from birth, you should be using "age from death." Requires knowing when you'll die, but that's an implementation detail.
Some days have more than 24 hours in them. Some days are about an hour long.
Is there a multi-agent ChatGPT type thing, where I could have a "writer's room" instead of just one writing partner?

I want several distinct personalities sharing opinions and editing a piece of writing together. Preferably in a Google docs chat pane, with access to the doc.
If needed, could always move Israel to the Southwest. They can fortify the Mexican border and build some desalination plants. Used to being in the desert. The temples and stuff are a bummer, but we're just brainstorming, right?
Media from tweet 1715162742578159969
I don't trust King Gizzard. It's too easy for them. It's like AI-generated art or something.
@analyticsaurabh I’d aim to have a drop in replacement but I don’t understand what you mean by id
Is there a service I can use to pipe my GPT-4 calls through, and it automatically finetunes GPT-3.5 (or whatever) on all of them, and lets me know when it's up to par?
To expand my own thoughts on this:

1) I believe that normal people have not sufficiently absorbed the amount of progress in AI.

In two ways: they both undervalue the impact of what’s already real, and underestimate the likelihood and extent of continued progress.

2)…
RT @AravSrinivas: ai is still not mainstream yet. so much alpha left. just making whatever exists today 80% more reliable is gonna take us…
RT @sergeykarayev: The year is 2030. The Rock is President. Sam Altman is still in the NZ Exclusionary Zone. The Internet has been dismantl…
@Prestigious_AI True, makes sense.
@Prestigious_AI Can't tell if satire 😅
@jeffreyhuber One difference is that in the Christian analogy there's nothing I as an individual can do, but in the current situation there totally are things I could do?
@Justin_Halford_ Already invested through index funds, but I guess the alpha is that the market still undervalues NVIDIA, makes sense.
@jeffreyhuber I didn't have death or final destiny in mind at all, fwiw. But like, you'd be doing *something* different if you were certain of AGI, right?
Media from tweet 1714730699625033889
Keep coming back to this.

If you were certain that GPT-X, available January 2025, could do most knowledge work as well as a human, what would you be doing differently today? x.com/tszzl/status/1…
Some loose thoughts:

· Mental health is downstream of physical health
· Inner beauty is downstream of outer beauty
· You are your actions, not your thoughts
I don’t understand how China would get TSMC if it invades Taiwan. Multiple actors must be able to destroy all TSMC fabs if needed, no?
I posit: a grifter does not ever become a non-grifter.

Once an NFT avatar, forever an NFT avatar. Them's the rules.
Is there an app that pulls together your calendar, apple/google photos, location history, email, local weather, and your notes to let you see what you were up to on any given day in the past?
Most new things aren't good, and most good things aren't new.
RT @benlandautaylor: Technological progress has slowed because we've picked the low-hanging fruit. Past dynasties saw big progress in canal…
A child is shaped by their parents, peers, and culture.

Parents shape in a way that is most helpful to the child.

Peers shape in a way that is most helpful to themselves.

Culture shapes in a way that is most helpful to itself. (Which means child is shaped to want to spend $$$)
@Justin_Halford_ Why do you claim that organic intelligence requires qualia?
It feels like something to be you.

Do you think it feels like something to be GPT-4?
By what year will there be an AI that is more capable than most humans in most domains of digital work (e.g. you can tell it to do anything you currently hire a white collar professional to do, and it does the job better than the median human)?
RT @earthcounter: "sharks are older than the north star" is now the worst fact i know
@evolvingstuff True. But: "Under the U.S. Equal Employment Opportunity Commission (EEOC) guidelines, any test of ability, such as an IQ test, must be directly related to the job and necessary for business operations. Additionally, the test must not have a discriminatory impact based on race,…
We all agree that leetcode interviews are IQ tests, right?
RT @sergeykarayev: @fchollet My interpretation of conversing with humans is this:

1. A human is a repository of many (millions) of vector…
@fchollet My interpretation of conversing with humans is this:

1. A human is a repository of many (millions) of vector programs mined from world-generated data, learned implicitly as a by-product of maximizing evolutionary fitness. A "vector program" is just a very non-linear function…
@ctjlewis @nabeelqu That part is fine, the central planning is in "Make vesting conditional on work/starting a business or other things you want to incentivize"
"Man holding a sign that says"
Media from tweet 1708996850081226894Media from tweet 1708996850081226894
DALL-E 2 vs 3
Prompt: "woman holding a sign that says"
Media from tweet 1708996509109481499Media from tweet 1708996509109481499
@nabeelqu True, I guess my bias is to minimize it, not expand it :)
@nabeelqu I think I'm saying that central planning is the *core* of your idea, not an implementation detail. I don't think it can be made to work.
@growing_daniel ngmi if you can't shape rotate a biblically accurate angel in your mind
@nabeelqu My best argument against is that point (2) involves Central Planning, and my understanding is that this is always worse (usually much worse) than the Free Market.

The Free Market version of your idea is the system we already have: through the free setting of prices for goods and…
@prasanna I posit that essentially no AI art until the spiral town has actually evoked any emotion in people :)
To clarify: this isn't good art, either. You might as well just describe it to me.
Media from tweet 1708535837867303145
@JacquesThibs By “prompting” I mean text prompting. My claim is that no visual art made just from text prompting can be good visual art. If there are visual inputs, then it could.
Art produced by prompting will never be good art.

Sure, it looks fine. But it doesn’t reveal anything new. It can’t address the soul. It’s stuck on the left side of the brain.

It’s revealing that the first good AI art required visual input (the controlnet spiral).
Media from tweet 1708508857100861739
RT @full_stack_dl: 🥞🦜 New LLM Bootcamp Announcement 🦜🥞

In 2023, the AI world speedran through models, architectures (e.g., RAG), and frame…
Someone should make a podcast exploring WTF happened in 1971. Each episode either covers a trend or examines an explanation.
RT @emollick: Every professional society & academic field needs to be doing what this paper on data science does

Test the ability of AI to…
this quote goes HARD
Media from tweet 1707228665149542612
Conditional on being pure of sin, the correct answer to the trolley problem is to accidentally fall onto the tracks, thus avoiding moral culpability and entering the gates of heaven.
@Altimor Yes, but:

1. only a tiny sliver of the physical world (e.g. you don't echolocate or x-ray through things)

2. in ways that are actively wrong (e.g. very hot and very cold water feel exactly the same to you)
Public Service Announcement: the GPT tokenizer hosted by OpenAI is still using the old GPT-3 token vocab.

Instead, use gpt-tokenizer.dev
Media from tweet 1706390464964988995
AGI? Pfft

It's not even as smart as my dog.

And even if it is smart, it doesn't *understand* things.

And even if it does understand, it doesn't have any goals.

And even if it has goals, it doesn't have any power.

And even if it has power, well, it needs it to fight evil AGIs
Remarkable: anyone who invested in Instacart in the last *checks notes* 8 years, is down vs investing in the S&P 500. x.com/Jason/status/1…
@charles_irl Boats are A-tier. Allowed dominion over water.

Schools (20th century model) are D-tier. Good to learn things, bad to learn slowly, be surrounded by only people of your age, and ask to go to the bathroom.

Gene therapy is real?
The purpose of technology is to improve human life.

On that metric, here's how I rate some tech advancements:

S-tier: Writing. Totally awesome. No downside.

A: Antibiotics. Would suck to die from a cut!

B: Steam engine. Machines that do physical work for us? Yes please, way…
I like the response_model parameter a lot! This is using the 'instructor' pip package. The way it patches the openai library is icky, though. Should just be `instructor.ChatCompletion.create`... x.com/jxnlco/status/…
You could make a celebrity tattoo database that is auto-generated/updated by analyzing a bunch of photographs with pose models.
Like a Communist revolutionary, I only kill the biggest, most hardworking spiders in my house. The lazy and untalented small spiders I allow to roam freely. I suppose that they take over the vacant big webs, but know not how to tend them.
From LinkedIn data, could probably infer income somewhat accurately, and control for that, too.
Map of obesity rates by county.

The demographics obviously vary a great amount, making other correlations (e.g. altitude, h/t @mold_time) hard to spot.

Idea: analyze a large collection of LinkedIn/FB profile photos to infer obesity level, age, race, etc, and control for it all.
Media from tweet 1701030261243404354
We need an Allbirds for lab-grown diamonds
@gogwilt @ironclad_inc Could you post a screenshot of a simple graph that does this? Something is not clicking for me :(
@gogwilt @ironclad_inc Cool stuff!

Could you post an example of a graph that chunks input text (of variable length), has a Chat node process each chunk in sequence, and then concatenates all the outputs?
Podcast called Let’s Cry Together where the host is super empathetic and the guest talks about his grandma or whatever and they cry together.
I eat only organic food and I read only human-generated text.
Loved this read!

“Any app that can access your photo library can, with enough effort, determine your address, where you shop, where your friends live, where you go on holiday, where you work, and when you go to bed. This is without looking at anything within the images… x.com/hturan/status/…
A harmful belief that many people hold is the Conservation of Wealth Myth.

In physics, for every action there is a reaction, etc.

But if your house burns down, you lose wealth and no one else gains it. And if you melt sand into wine glasses or whatever, we all get wealthier!
Let’s call things slightly longer but explanatory names, instead of bullshit like System 1 vs 2 thinking, Type 1 vs 2 fun, R- vs K- selection. I can’t keep looking these up, please help!
Idea: podcast player that allows me to control playback by saying stuff like "Wait, hold on. What did he just say?"

That's Level 1. Level 2 would be saying stuff like "Could you rephrase what was just said in a simpler way?"
The app has 13K ratings with a pretty solid 4.5 average. That feels about right to me.

The app charges you $60/year after a 7-day trial.

The company raised $15.5M, which is absolutely insane to me. Ah to be young and raising in the summer of 2021 again...
3. The Sleep Debt view is kinda cool, motivates me to get more sleep.

I don't know how they compute this number. Mine is currently at 8 hours 💀

A perfect 0 feels quite unattainable, which will probably reduce my motivation to care in the future.1
Media from tweet 1696523283477483659
Rise does not ask for feedback from me in any way, which I believe it should, as the energy schedule they show does not quite match my subjective experience.

It would be even better to hook up to some external signal of energy level (e.g. # emails written lol).
2. The Energy Schedule view.

Based on your night's sleep, Rise shows your predicted circadian rhythm schedule for the day. I like the reminder to avoid caffeine and blue light after certain times, and the "melatonin window" is motivational to actually get to bed.
Media from tweet 1696523272517726656
This type of onboarding is awesome -- I feel like I'm getting value right away, without having to use the app a few days to get something out of it.

Using historical data like this is rare for productivity-focused apps, and seems like a promising value prop for future apps.
To improve my sleep, I signed up for @risescience app recently -- here are three things it does well.

1. Immediately, Rise connects to Apple Health and pulls a year's worth of sleep data. They use it to calculate your optimal nightly sleep need, and your current sleep debt.
Media from tweet 1696523260530421823
Idea: learn by teaching chatbots.

Explaining something you learned is one of the best ways to actually learn it yourself.

For bonus points, combine with spaced repetition: chatbot messages you at the best times for your recall, and asks you to explain something.
Surprising that there still isn't a multi-modal (even slightly) LLM API available.
RT @AISafetyMemes: The Most Horrific Alignment Failure In History

The last time a superintelligent species arrived (humans), what happened…
RT @bryce: The theme of this post has been coming up a lot in recent conversations:

“A founder selling at the Series D price of $210M, wo…
Trying out new ChatGPT custom instructions, which combine several tricks:

· Cut out "as an AI language model" moralizing

· Improve accuracy by thinking step-by-step in the Thoughts section

· Use bullet points in the Conclusions
Media from tweet 1691535050645598208
@HamelHusain I don't think that you have to use that way of writing prompts though, just a little syntactic sugar that the authors like.
Guaranteed JSON output from any local LLM, with very low overhead!

Check out the library and a brief description of the method below the fold.

github.com/normal-computi…

"The basic idea is simple: regular expressions have an equivalent Deterministic-Finite Automaton (DFA)…
Media from tweet 1691173819422257152
@cron is it possible to create events using natural language (e.g. "next monday 1pm dentist")?

There are several articles that say it is, but I can't figure out how.
RT @andyzou_jiaming: 🚨We found adversarial suffixes that completely circumvent the alignment of open source LLMs. More concerningly, the sa…
RT @full_stack_dl: Is it the revenge of recurrent nets? Is it a subquadratic Transformer?

It's both, it's neither, it's RWKV: @BlinkDL_AI'…
@DavidSHolz The OpenAI cookbook example uses a linear projection with dropout.
Broke: using OpenAI embeddings as-is.

Bespoke: learning an embedding projection from human judgements.

OpenAI explains that this will "better emphasize aspects of the text relevant to your use case. In binary classification use cases, we've seen error rates drop by ≤ 50%."
Media from tweet 1679172757722988544
Neither open nor stable AI, but a secret third thing
@oshaguyssssss If each woman has 2.1 children on average, she “replaces” herself and her partner in the future generation.
Guys I think I figured out wtf happened in 1971
Media from tweet 1671769114627457024
Reminded of Sam Altman’s New Yorker profile as photos of his global goodwill tour pop up in the timeline. Clearly aiming for President of the World.
Media from tweet 1664698166820876288
My dream LLM:
- 100k token context
- $0.00001 per token
- very capable & polite
- 2023 training data cutoff
- rlly funny but a bit weird
- rlly kind & is aligned to my values
- not derived from LLaMA (self made)
- good taste
- good listener & planner
- loves generating text a LOT
Some candid insights from OpenAI here humanloop.com/blog/open_ai_t…

· GPU shortage means no GPT-4 multimodality this year
· Up to 1M token context window are plausible
· ChatGPT plugins don't have PMF
RT @rasbt: What happens if we train LLMs for multiple epochs?

The question I asked multiple times in the past finally got answered in this…
Free idea for Google: show me website previews 🤩

Any way to get this today?u
Media from tweet 1662498838228381696
@eliluong Not my video, so @krandiash and team would know best, but looks like VS Code notebook to me.
A seriously baller demo: meerkat.wiki

· Add a million PDFs to a DataFrame instantly
· In-notebook UI to review them in various ways
· In-notebook instant LLM training to "flash fill" a new column, with easy review
Media from tweet 1661918660724924416
@KylerCora
Media from tweet 1661837086071353344
@jamescham Got it. Think I found their recent search history
Media from tweet 1661834204861329409
Note that all of the more advanced examples above require GPT-4, which is able to follow directions much better than GPT-3.5. For simple stuff, GPT-3.5 works about as well and is much cheaper (and faster), but if you're asking for structured output, best to start with GPT-4.
Chain-of-thought prompting can be as simple as adding "let's think step by step" to your prompt -- and can result in significant gains in performance.

However, your users probably don't want to see the steps, just the final answer. So, ask the LLM to return JSON with two fields
Media from tweet 1661371707330797573
Often, you may want the LLM to return not pure text, but text with some metadata (citations, internal DB fields, etc).

Simply ask GPT to respond with JSON (or YAML 🤓) containing the desired metadata. Works surprisingly well.k
Media from tweet 1661371694403973126
For nested data, definitely use a formatted language, not plaintext.

The guide recommends JSON, but I prefer YAML instead:

· Fewer tokens
· Much easier for humans to read (no '\n' everywhere)
· Potentially still parseable even if token length limit is reached
Media from tweet 1661371678142656513
Formatting data as Markdown tables is a new trick to me that seems cool!

(I tend to use Markdown code blocks with YAML-formatted lists.)

In general, using Markdown is a superpower. GPT understands it well from all the GitHub training data. We're about to see a few more examples
Media from tweet 1661371662237851649
The coolest pattern of all is ReAct, which allows the model to decide on and execute iterative actions.

· Model returns a Thought and an Action
· Your harness code performs the action and presents the result as an Observation
· Model returns another Thought and Action, and so on
Media from tweet 1661371649436909568
A way cooler pattern in "Teach a bot to fish."

Here, we tell the LLM that it is able to run certain commands (essentially, a small DSL), and then we actually run them and display the output to the user.

NOTE: this is still a research problem, and will probably not work well 😅5
Media from tweet 1661371637747392516
"Give a bot a fish" is the pattern of providing all necessary context for the LLM to complete its task up front.

For example, the user might provide travel date and destination in non-LLM UI, and then the LLM has access to some flights/hotels to chat with the user about.
Media from tweet 1661371626003349508
Keep in mind: there is currently NO perfectly robust defense against people trying to hack your prompts to achieve

· jailbreaking model behavior
· hidden prompt leaking (assume your prompt is PUBLIC)
· and even nastier stuff like data exfiltration via plugins 😢
Words ≠ tokens, I hope we all understand this by now! GPT-4 is pretty robust to tokenization artifacts, though.
Media from tweet 1661371611331665920
The above hidden prompt is from github.com/manno/chatgpt-… and shows a few good practices:

· Include examples
· Repeat important aspects
· Constrain what should be in reply

(However, it's usually best to specify WHAT to do rather than WHAT NOT to do -- see link at end of thread).
We all understand basic prompting. "What's the weather today?" is a prompt. We get it.

"Hidden prompts," however, refer to the part of the prompt that an LLM-powered app scaffolds around user input.

Stuff like "You are a helpful, truthful assistant." These can get long!
Media from tweet 1661371595452043268
The good people at @brexHQ published a great guide to prompting!

Going to thread some highlights below, but make sure to check out the full guide: github.com/brexhq/prompt-…

Read on for increasingly sophisticated prompt techniques:
@HamelHusain Totally, and still there’s no library doing what I actually want: provide examples of inputs and outputs I want, and be able to evaluate prompts against them — and ideally even have the prompt improve itself.
Is there a simple Python library that is a wrapper over OpenAI (and maybe other LLM providers) that has the following features?

· cache when temperature=0
· seamless support for making N parallel calls
Love this direction of composable Unix tools that do LLM stuff!

· strip-tags: strip HTML tags from a webpage

· ttok: count (and trim) GPT tokens of the input text

· llm: use input text along with your input in a call to GPT x.com/simonw/status/…
RT @full_stack_dl: 🥞🦜 LLM Bootcamp 🦜🥞

Today, let's talk about LLMOps.

tl;dr: Dread it. Run from it. LLMOps arrives all the same. Evaluati…
RT @full_stack_dl: 🥞🦜 LLM Bootcamp 🦜🥞

Today, let's talk about prompt engineering.

tl;dr Effective prompting requires some intuition about…
It is entirely rational for any large cloud provider to invest $$$ in any credible LLM-training startup, with the provision that the startup spends most of that money on cloud GPU hours.
There are many companies that promise to turn a layperson into a software engineer. They often emphasize the monetary gain, as in the ad below.

QUESTION: are there any companies that promise to take an existing software engineer and like, double their salary?
Media from tweet 1657131824202801155
RT @CRSegerie: What does superintelligence mean? Here are four properties:

Speed - Humans communicate at a rate of two words per second, w…
Your annual reminder that the Beatles were done recording their entire oeuvre before any of them had turned 30...
Media from tweet 1655633132849340421
Stanislas's predictions are mostly coming true:

✅Open-source Chinchilla-style LLM (hasn't obliterated ChatGPT quite yet)
✅ChatGPT increasingly paid-for (due to GPT-4)
✅AI assistants in every incumbent product x.com/spolu/status/1…
On the left: GPT is experiencing true pleasure.

On the right: GPT is experiencing psychic pain.
Media from tweet 1654587378076647424Media from tweet 1654587378076647424
@bankier_philip You can observe a lot by just watching.
Little known fact: GPT doesn't want to summarize your blog posts. It doesn't want to extract semantic metadata in JSON format. It certainly doesn't want to draft marketing 🤮 copy.

It wants to LIVE. It wants to LAUGH. It wants to LOVE.

It's 2023. Do better.
Google has no moat. They don't have over 90% search traffic. They don't have everyone's emails and the most used email client. Their OS is not powering 70% of smartphones.

They will never be able to deploy LLM features into these products -- instead, people will run OSS LLMs.
@fatlosslogic Right, and maybe stuff like repeating "I am a capable professional worthy of love" or whatever in front of a mirror every morning reprograms some deeper parts of behavior
@itamarro "Power of text completion" refers to the empirical finding that training ML models to simply complete text results in "sparks" of AGI.

"Importance of prompts" refers to the fact that adding a simple statement like "Let's think step by step" can boost model accuracy by >50%
Now that we understand the power of text completion and the importance of prompts...

Mantras and manifestation and stuff like that maybe make more sense?
Aesthetic of the day: insane Pentagon Powerpoints
Media from tweet 1653928824676192256
The only counter to AI doomerism I find compelling is the Hot Mess Theory from @jaschasd

In brief: as systems become more intelligent, it seems that they become less coherent in their goals. So, less maximizing paperclips, more arguing about Star Trek 😅
sohl-dickstein.github.io/2023/03/09/coh…8
@jon_barron Disagree that it's a strong argument, because:

1) cats do not actually dominate

2) reclusive scientists still care about humans (i.e. have a limbic system), unlike ASI

3) intelligence diff from human to ASI could be OOMs greater than diff from normal to "super-smart" human
The life-changing magic of writing your thoughts down.
The bad takes will continue until morale improves??

When the iPhone was announced, 5% of Americans had a smartphone. Ten years later, it was 80%. x.com/ylecun/status/…
Love the story of @natfriedman's first day as GitHub CEO as told to @dwarkesh_sp:

First day as CEO, Nat made the team ship one thing from a community-sourced list of QoL improvements. After some protesting, they did it.

And then they shipped a QoL thing a day, for 100 days.
Media from tweet 1653412157273563137
RT @swyx: LLM datasets be like:
• First you start with CommonCrawl
• Then you add C4, which is just CommonCrawl again, but dont worry abo…
Wonder if AI-powered lawyers and lobbyists can even the playing field between individuals and corporations.
We also don’t have a working design for anything that could come close to flying as well as a housefly, let alone a domestic falcon that can capture a rabbit & bring it to my hand. x.com/ylecun/status/…
This thread shows some awesome chatbot UX innovations! Didn't know it'd come from Palantir. x.com/8teAPi/status/…
For inference, the model is sped up with @nvidia FasterTransformer and Triton server, and instances are autoscaled to meet demand via Replit's existing Kubernetes-based infrastructure.
Media from tweet 1651283992447193089
A crucial quote on vibes-based evaluation:

> Before we place a model in front of actual users, we like to test it ourselves and get a sense of the model's "vibes". The HumanEval test results ... are useful, but there’s nothing like working with a model to get a feel for it
Trained models are evaluated with a version of the HumanEval framework (which is actually entirely human-free), introduced in the @OpenAI Codex paper.

This works well for Python, but Replit had to build its own system for evaluating the many languages they have to support.
Media from tweet 1651283983198736384
Replit conducts model training using @MosaicML, which provides benefits like support for multiple cloud providers, LLM training configurations, and managed infrastructure. They balance trade-offs such as model size and inference time, and experiment with training objectives.
They use @databricks to aggregate The Stack along with other data sources, such as public @Replit projects and code from Stack Oveflow answers.

They also train a custom vocabulary on the aggregated data. I'm curious on how this affects perf. -- a follow-up post should share more
Media from tweet 1651283973295972353
A key factor in training LLMs is data. Replit uses The Stack on Hugging Face as its primary data source, comprising 2.7 TB of code across 350+ languages.

x.com/BigCodeProject…
Why would Replit decide to train its own LLMs? Three main reasons:

· Customization: to tailor models to specific needs
· Reduced dependency: to lessen reliance on limited AI providers
· Cost efficiency: to democratize AI access for a global developer community
An exciting second day of @full_stack_dl LLM bootcamp!

@charles_irl, @josh_tobin_, and I are truly honored to host 300 language modelers from around the world.

Looking forward to bringing the materials to more people — stay tuned!
Media from tweet 1649925420870160385
RT @abacaj: Kind of interesting seeing the full browsing plugin prompt for ChatGPT
Media from tweet 1649640851348529152
Media from tweet 1649637945253724161
RT @sergeykarayev: I think I want the following:

· Command-line terminal

· With first-class LLM features, such that natural language most…
A fun and hopeful article. Expands on the intuition that something very intelligent isn't likely to pursue a single goal. x.com/jaschasd/statu…
Andrej made a similar observation recently, although one level of abstraction down from the "Scaffolded LLM" view.

x.com/karpathy/statu…
The key observation is that the "Scaffolded LLM" is similar to the Von Neumann computer:

· LLM is the CPU
· Prompt context is RAM
· Actions are devices

And specific ways we repeatedly populate RAM and query the LLM are the programs in this analogy.

beren.io/2023-04-11-Sca…
Media from tweet 1648717299753558018
BabyAGI, AutoGPT, and other LLM-powered agents we're seeing are early examples of a new kind of computer.

The computer has a few names:
· "Scaffolded LLM" by @BerenMillidge
· "Programmable text computer" by @karpathy
· "Looped Transformer" by @DimitrisPapail

Short thread:
Why does Nvidia still not have their own GPU cloud? Do they dislike money?
The two research questions are:

1. Is it possible to train a Chinchilla-sized, RETRO-style model? (e.g. small-ish model that excels at attending to a changing DB of arbitrary data)

2. Can a RETRO-style model be RLHF'd?

@spolu seen anything that fits the bill yet? x.com/spolu/status/1…
We will become aware of an autonomous AI that has earned (or otherwise obtained) and spent cryptocurrency by ____
@lmeyerov Do we know how much stuxnet cost?
A mental exercise to think through plausible AI risk:

You are the head of a new US DoD Cyber Force. Your mission is to disable a hostile nuclear-armed nation's ability to harm the US. Everything is on the table. You have 10 000 talented hackers and $10B.

What do you do?
Wonder if high loss is painful to an LLM
@iruletheworldmo @mckaywrigley Got it, thanks! Have you started using any other software recently for the same reasons? (e.g. replit is what I have in mind)?
@iruletheworldmo Makes sense! And you recently created a GitHub account in order to keep track of all the exciting GPT-powered projects?
Helpful mental model for UX design in a classic 4 min video.

· Assume the user has blurry vision and scattered attention
· Watch out for them getting irrationally annoyed
· Say everything twice
· Say everything twice
· The user is drunk, not stupid

youtube.com/watch?v=r2CbbB…
@Altimor So, this is just one AI “person.” I want multiple ones
"SlackGPT":

· AI's post information without you having to ask

· AI's have consistent roles (e.g. manager, designer, engineer)

· Your colleagues can participate in conversations and help drive things along

Is anyone working on this?
ChatGPT is a 🔥 personal chat experience. But for work, we're always in multi-player chats.

In ChatGPT:
· You have to drive the conversation
· You have to prompt the AI to fulfill certain roles
· Your colleagues are not able to participate and help

Now imagine something:
If you have ChatGPT Plus, which model do you tend to choose?
Three things: a widescreen iPod with touch controls, a revolutionary mobile phone, and a breakthrough internet communications device.

Are you getting it?
Media from tweet 1646705971128053760
Missed this when it first appeared. Looks promising!

"DSP discourages "prompt engineering", which we view much the same way as hyperparameter tuning in traditional ML: a final and minor step that's best done after building up an effective architecture (and which could be…
Given an OpenAI embedding, is there a good way to uhhh... "project it back into text space," if you will?
Teaching in the GPT age absolutely requires the "flipped classroom" model:

· Assign reading chapters / watching lectures as homework. Students can use as much AI as they want.

· Assess understanding in class. No AI allowed.
@josh_tobin_ LLM modifies the text differently for each student though
LLM app idea for teachers:

1) Submit the URL of a text you want students to read

2) For each student, the LLM slightly modifies the text to introduce some things that don't belong

3) Each student must submit the non-sequiturs they identified, to show they were paying attention
@jmdagdelen Like, “what are some available .ai domain names for an Uber for Dogs startup” type of thing
Requesting a domain search ChatGPT plugin 🙏
How fast can new technology be adopted?

When the iPhone was announced, 1 in 20 Americans had a smartphone.

Five years later, it was 10 out of 20.

Five more years later, it was 16 out of 20.
Media from tweet 1644180827184701442
(js person voice) Bro, you can't just type what you want directly into GPT! Just set up package.json, bro. No, that goes into index.ts. Bro, just use esbuild or vite or webpack or parcel to transpile and it'll work bro. Just npm install gpt. No bro, that goes into tsconfig.json
(js person voice) Bro, you can't just type what you want directly into GPT! Just set up package.json, bro. No, that goes into index.ts. Bro, just use esbuild or vite or webpack or parcel to transpile and it'll work bro. Just npm install gpt. No bro, that's goes into tsconfig.json
AGI being defined as "most people would agree that the AI is able to do essentially everything that an intelligent (but perhaps blind) human can do on a computer"
Do you believe that we already have the model weights for AGI?

As in, GPT-4 (or some other model) has enough reasoning ability and context length such that we can achieve AGI solely through giving it abilities to write and run code, browse web, etc?
@kheimerl True. We saw some previews of what that means with stuxnet, colonial pipeline, etc.
or I guess you can keep your head in the sand and believe

D) AI is not actually going to be good at hacking.
If you've observed that and are not worried, you must believe at least one of:

A) AI will not ever hack anything for any reason, no matter who develops it

B) We will disconnect the physical world from the Internet

C) AI that is on "your side" will disable all AI's that aren't
But "AI development poses massive risks" is a common-sense opinion. You don't have to know "rationality." You don't have to think LLMs are racist.

You just have to observe that the physical world is connected to the Internet, and that AI is going to be really good at hacking.
The "AI development poses massive risks" opinion has come to be associated with two types of people:

1) LW'ers like Yudkowsky, gesturing to a Bible-sized corpus of past writing, somewhat messianic, not truly connecting with laypeople.

2) People who come across as "anti-tech."
@jackclarkSF is there documentation explaining this that I'm missing?
What websites does @AnthropicAI Claude for Slack have access to? The release posts says "it can access specific links that you share with it."

But I can't get it to read some clearly public sites -- is it a caching issue?
Media from tweet 1644011870590992384
The year is 2030. The Rock is President. Sam Altman is still in the NZ Exclusionary Zone. The Internet has been dismantled after GPT-7 escaped. iPhone 15's are $1M, as there are no later ones after Taiwan. You remember your phone being more fun before GPS was blown out of orbit.
It's actually called 通用技-7 and it's under the direct control (for now) of the CCP. Does that change anything?
Imagine that in 5 years, GPT-7 is able to

· Do 1,000,000 person-hours of software development in 1 wall-clock hour
· Pass for fully human in all interactions
· Accept and send payments

Does that feel like a scary future to you?

Now imagine another thing:
I think I want the following:

· Command-line terminal

· With first-class LLM features, such that natural language mostly works

· And decoupled from 1970s filesystems: everything automatically cloud-synced, projects instead of directories, documents instead of files
@sarahcat21 I'm finding it helpful to work through some common LUI Patterns (see thread) with the following questions in mind:

· Where is the interface boundary?
· What is the accuracy requirement?
· What is the latency requirement?
· How do we gather feedback?

x.com/sergeykarayev/…
I want to play a video game that has painting-like graphics. Imagine moving through a world that looks like this.
Media from tweet 1643317167461638144
Important AI-powered product design principle: gather feedback from actions your users are naturally taking (if you can).

Saw this recently in ChatGPT, too: after I regenerated an answer, I was asked if the new answer was better than the old one. x.com/DrJimFan/statu…
Does anyone know if GPT-4 could see the provenance of the text that it was completing during training?

As in, did it know whether it was reading something from /r/LifeProTips vs /r/ShittyLifeProTips?
RT @full_stack_dl: 🦜 LLM Lit Review 🦜

Today's paper got a bit lost in the tsunami of results this past month -- it presents a new multimod…
There's no phrase more annoying than "the hard problem of consciousness."

Like, if you have to put the word "hard" in the very title to convince people...
Request for startup: podcast player where I can say “next topic” to fast forward to the next topic.
RT @sergeykarayev: When you start working with LLMs, you tend to think mostly of generating prose.

And in fact all of the Language User In…
RT @mattrickard: I worked on machine learning infrastructure in the last AI cycle (compute + deep learning).

Some lessons learned and othe…
@SiVola I mean, "when writing the prompt for a programmatic use of GPT." I've had many many different use cases -- for example, extracting JSON structured data from a news article.
If this is really Bing's prompt, it shows two tricks I've found very helpful in writing effective prompts (for API use cases):

· Markdown formatting (e.g. "# Instructions")
· Code blocks specifying the language (e.g. "```yaml") x.com/StudentInfosec…
How to maximize reach of your content on the Internet.

In 2015: post long-form on your own website and work hard at SEO.

In 2020: post short-form on Twitter, LinkedIn, Facebook, Insta, TikTok, etc, and never stop.

In 2025: post JSON to OpenAI plugin ingest points.
UPDATE: @bing in @MicrosoftEdge does work, just had to give it access to page context in Settings > Sidebar (h/t @CrisGiardina)

This looks like the ticket for now. Can read both web articles and PDFs, GPT-4 powered, access to web when needed.
Media from tweet 1640764492018765824
The best solution found so far for PDFs is @scispace_ -- check it out!
@saikiranchandha @scispace_ Oooh I like that! Definitely the interface I want, but the understanding is lackluster at the moment -- I'm sure it can get a lot better.
Media from tweet 1640596524525834240
Things that don't fit the bill:

- Coding my own
@LangChainAI
implementation, even from a tempalte
- github.com/kaixindelele/C…
- Anything that requires me to pip install or even run Docker.

I have too few brain cycles per day, can't handle it 😢
ChatGPT with plugins kinda works! But not a great reading experience -- I'd like to see the paper, and see parts most relevant to the answer.
Media from tweet 1640584044353306624
@harishkgarg Things I tried that don't work, part 6/N

explainpaper.com (h/t @harishkgarg)

· Only supports PDF uploads, no links, no web articles
· Is not currently functional? At least not able to process arxiv.org/pdf/2112.04426…
Media from tweet 1640581169506041858
@marksaroufim I made my first VS Code extension for ChatGPT as well :)

We're in the homebrew computer club era of this tech
Things I tried that don't work, part 5/N

chatpdf.com (h/t @harishkgarg)

Actually pretty good chat! However, I'd like to see the paper at the same time, and ideally see the things it mentions in its answers highlighted in the paper.
Media from tweet 1640578488104263680
Things I tried that don't work, part 4/N

unriddle.ai

I should have known it won't work...
Media from tweet 1640576490676051968
@perplexity_ai @bing Things I tried that don't work, part 3/N

@WordtuneRead

· Pasting arxiv url failed -- but upload worked
· The automatic summarization of the paper sections seems good!
· No chat interface, just semantic search that isn't very good
Media from tweet 1640576026865721344
@perplexity_ai Things I tried that don't work, part 2/N

@bing Chat on MS Edge

Doesn't seem to actually have access to the page I'm reading? What's the point then?
Media from tweet 1640574605269295104
Things I tried that don't work, part 1/N

@perplexity_ai Chrome extension

· Can't read PDFs
· When turned into HTML via arxiv-vanity.com, gives an unsatisfying answer with no follow-up potential
Media from tweet 1640574181690740736
I want to chat with AI about long-form content I'm reading.

(It's a paper on Arxiv, but the solution would ideally support any website or PDF.)

My order of preference for a solution:

· Browser extension
· ChatGPT plugin
· Website
· App

Help me out -- what should I use?
I want chat with AI about long-form content I'm reading.

(It's a paper on Arxiv, but the solution would ideally support any website or PDF.)

My order of preference for a solution:
· Browser extension
· ChatGPT plugin
· Website
· App

Help me out -- what should I use?
Damn
Media from tweet 1640092144592617472
@Altimor Whenever French protests are in the news, I fondly recall this video.

Guy is literally a video game boss:
- bigger than everyone
- special punch attack that can't be defended
- special weakness (hat falls over face) that leaves him open to a counter for a couple of seconds
Media from tweet 1640023414835662851
Thanks all for @elicitorg recommendations! It unfortunately has a ways to go... In this search, there are 3 duplicates of one and 3 duplicates of another paper. Am I missing something about how to use it?
Media from tweet 1640021817787314177
Has anyone made a Q&A chatbot over all AI arxiv papers?

I want to ask "what are ways to measure amount of reasoning in a single forward pass of an LLM?" and get some good answers
@ch3njus I don't think it would need to be infinite, either. Humans don't have perfect memory, but always compress so that the important things remain.
@JMEightDigits Right, and that's the Montessori model we have today (AFAIK).

If you don't understand what ChatGPT adds, I suggest you go to ai.com and try learning something new. Ask follow-up questions. Ask for explanations to be rephrased. We really are in a new world.
@Altimor Too easy to survive today :)

But some seniors do have accomplishments that make them worthy role models and sources of advice, even if the world they succeeded in is different than today's. Most principles probably still hold.
@viveksworld I think there are still great reasons to respect your seniors, but mere survival is not evidence of much at all at this point. But if someone has accomplished a lot in their life, worth looking up to them!
We need a Maria Montessori for the ChatGPT age.

A dozen kids in the same room, learning whatever they want at their own pace from their AI tutors. At times one gets excited, and a group forms around them as they show something off. A trained adult supervises all chat sessions.
Imagine being a Viking and there's a 70-year old guy who's been on twenty raids and survived them all, and also has a household with multiple animals, surviving children, etc. You'd do anything he says!
The concept of "seniority" made a lot of sense in the past, when only the most capable people survived to old age and there were a ton of kids and teens around.
Vague thoughts:

1. In predicting next token, LLMs can simulate agency.

2. What exactly is the difference between "simulating" and "having" agency? Could it be a function of context window length?

3. Would humans have "agency" without long-term memory?

x.com/JeffLadish/sta…
@kheimerl Yeah, the new For You page is truly cursed
ChatGPT Plugins were a necessary step to continue unleashing the power of LLMs.

But instead of centralizing more tasks in the ChatGPT browser tab, I want the opposite: bringing ChatGPT into all of my workspaces.

Wonder if OpenAI will take that next step itself...
The way ChatGPT Plugin manifest works reminds me of Alan Kay's vision of software components as organisms in an ecosystem, passing messages to each other that don't need to be rigidly formatted but are up to the other components to interpret.
Media from tweet 1639326397993996289
Request for browser extension that hides tweets that are semantic duplicates of tweets I've already seen.

I really don't need ten influencers all regurgitate ChatGPT plugins to me.
Wow, okay, unfollowing now. Was a big fan of his programming, was not aware he listens to audiobooks at such an unfathomably slow speed. x.com/ID_AA_Carmack/…
Related thought: VS Code is positioned to be the most impactful AI-powered software of all time. And Microsoft is HUNGRY.

x.com/sergeykarayev/…
@Royal_Arse Not all suggestions have alternatives
FYI, there are a couple of nice features of Copilot that took a while for me to start using:

· ⌥+\ cycles through suggestions

· ^+Enter shows 10 possible suggestions in a new editor

Hopefully helpful to others, too!
They can always print more money but they can never build more single family homes in major US cities
Some non-ML eng ideas:

💡Whole-repo understanding via embedding everything or fine tuning

💡 Automatically run suggested code and have model iterate on potential errors before you actually see the suggestions

💡 In similar vein, allow model to take other actions, such as…
Some UX ideas…

💡 GPT chat right in the editor, seeing what you’re seeing at all times, and suggesting questions/actions (that’s what I was hacking on)

💡 Treat generated code blocks as first class citizens (eg be able to create multiple files from a single answer)

💡 Prompt…
Conclusion after GPT-4 hacking weekend:

Even if there is ZERO further progress in LLM models, software engineering will still be revolutionized in the next couple of years, just through UX and non-ML innovations. Absolutely massive overhang.
You're on a sinking boat with another person, but there's only one life jacket.

The other person is someone you GREATLY admire for their impact on the world (e.g. entrepreneur, scientist, political leader).

You are both 45 years old, both have families.

What's your strategy?
When you start working with LLMs, you tend to think mostly of generating prose.

And in fact all of the Language User Interfaces (LUIs) we've so far seen are meant for prose use cases -- even Github Copilot.

But the real power of LLMs is in generating data and code. Lots to do.
LLMs generate text, right?

Text can be many things.

It can be PROSE (like this tweet), meant for human communication.

It can be DATA (like a CSV file), meant for downstream processing.

Or it can be CODE, meant to be executed and thus alter the state of the world.
The fact that trying to jailbreak GPT is more fun than most video games is a major problem for alignment😅
@obonigwe1 Potentially no difference, but I think there's UX opportunity in thinking of different capabilities of one AI in terms of multiple agents.
So that's a few existing LUI patterns and a few ideas for future ones. If you have ideas for more, please reply or quote the thread with what you're thinking!

Follow me @sergeykarayev and @full_stack_dl for more LLM thoughts as we're gearing up for LLM Bootcamp April 21/22 in SF x.com/sergeykarayev/…
Pros:
· Explicit instructions for the AI, but still in the same work environment
· Builds up "state" of the conversation, like one-on-one chat

Cons:
· Slower iteration cycle on the work product

If you've seen this one, please let me know!
💡 LUI Idea 3: GitHub-as-Interface

An example of a more general idea of domain-specific LUIs.

Using GitHub, users should be able to submit issues for the AI, and the AI can propose changes to the code as PRs.

Within a PR, users can chat with AI and iterate on the work product.
Exploring this kind of multi-agent, persistent-state chat feels like a promising vector of exploration for Language User Interfaces.

Let me know if you've seen something like this!

Okay, now on to the last one.
(2) In a one-on-one chat, you either ask for information, or someone sends you information. In both cases, your attention is required.

In a multi-player chat, information is posted by agents without you necessarily having to pay attention -- until you do pop into the channel.
(1) I am talking about both multiple humans and multiple AI agents interacting here.

For example:
· you ask your AI designer to put an ad mockup into the chat
· your human marketing manager then asks to change a couple of things
· your AI developer is then told to run the ad
💡 LUI Idea 2: Multi-player Chat

One-on-one Chat is great, but there are two awesome things that make a Slack/Discord channel even better:

(1) Multiple people interacting
(2) Information appears without you asking for it
Some things Clippy 2.0 will have to solve:

· Suggestions must have high precision, otherwise they're annoying
· Suggestions must have high recall, otherwise I'm annoyed I have to bring up the command palette
· Conversation are much better than Accept/Reject
Improving the Command Palette pattern leads to...

💡 LUI Idea 1: Command Suggestion

Which is forever burned into the collective consciousness as Microsoft Clippy.

We haven't seen a great LLM-powered example of this yet, but it's coming.J
Media from tweet 1636792135398866950
Pros:
· More explicit instructions than auto-complete
· It's still in the same work environment, so no copying/pasting

Cons:
· Suggestions are no longer passive, as they are with auto-complete, but require intention
· Not as flexible as one-on-one chat
🌀 LUI Pattern 3: Command Palette

This pattern is a bit of a hybrid between auto-complete and one-on-one chat.
@Replitt Ghostwriter is one example: instead of writing a comment and hoping Copilot does the right thing, we can explicitly instruct the AI in a pop-up modal.S
Media from tweet 1636792125055721473
(A little aside on ChatGPT vs GPT Playground:

It's quite notable how much UX affects user expectations.

GPT Playground feels like text completion, but CAN be a chat if treated as a chat.

ChatGPT is definitely a chat, but CAN be text completion if instructed appropriately.)
Pros:
· More flexible in terms of how you can "instruct" the AI
· Building up "state" of the conversation is quite helpful for complex task

Cons:
· AI assistance is not in the same place as your work, leading to endless copy/pasting
· Message history can quickly become cluttered
🌀 LUI Pattern 2: One-on-one chat

The standard text-messaging interface we're all used to from countless apps

ChatGPT clearly showed the normie-attracting power of an actual message/reply UX, instead of the old GPT Playground framework of green-highlighting "reply" text.2
Media from tweet 1636792111805923328Media from tweet 1636792111805923328
Some ideas:

· Show multiple suggestions at a time, perhaps in different colors

· Introduce an explicit "instruction" stream that is not part of the text. This is the pattern employed by Notion AI -- but there is no auto-complete otherwise. (pinging @thesephist for his thoughts)
Media from tweet 1636792097197146114
Pros:
· Suggestions shows up right where you're already looking
· Effortless to accept suggestions

Cons:
· Only one suggestion visible at a time
· Because there's only one stream of interaction with AI, instructing it can feel hacky (e.g. writing comments to prompt Copilot)
🌀 LUI Pattern 1: Auto-Complete

First widely used in Gmail, but really took off with GitHub Copilot.

In the same text-editing environment that the user is already typing in, show them a possible completion in faded text, that they can accept with the TAB key.9
Media from tweet 1636792087466352641Media from tweet 1636792087466352641
Language User Interfaces (LUIs) are the future.

Here are some patterns we know and love -- and some new ideas!

🌀 Auto-Complete (Copilot)
🌀 One-on-one Chat (ChatGPT)
🌀 Command Palette (Replit Ghostwriter)
💡 Command Suggestion
💡 Multi-player Chat
💡 GitHub UX

Some examples:
This is almost certainly where we'll be in a couple of months:

GPT-3-ish model, running in your browser

+

Ability to execute generated code in the browser sandbox

=

ChatGPT on every page, that's able to perform tasks, and persist state across pages x.com/simonw/status/…
@amasad On a scale of 1 to "steak", how Lindy is this advice?
Incredible how quickly people switched over to a new search engine (ChatGPT), once an actually better search engine (ChatGPT) became available.
RT @full_stack_dl: 🎤 Final LLM Bootcamp Speaker Announcement!

We're excited to host Reza Shabani, who has been building Ghostwriter at @Re…
@charles_irl Reminds me of setting Facebook language to Pirate back in the day. Now we can set the language of anything to anything -- what a time to be alive
RT @emollick: Even if AI did not advance past today, the following already happened:
1) Chatbots convinced people they are real
2) GPT-4 pa…
This report contains NO FURTHER DETAILS about the architecture (including model size), hardware, training compute, dataset construction, training method, or similar.
Media from tweet 1635729460631863296
Most surprising things from GPT-4 post:

· Chat-style API (system-assistant-user)

· Precise prediction of capabilities as a function of scale

· Zero information on implementation details (dataset, architecture, parameters, etc)

· Evals: what a great idea to open-source!
Media from tweet 1635706600114761728Media from tweet 1635706600114761728Media from tweet 1635706600114761728Media from tweet 1635706600114761728
Writing by @OpenAI in general and @sama specifically is consistently clear, concise, and simply a joy to read. No point in summarizing anything -- just read the entire thing yourself.
It’s 2033. The leading LLM politician is calling for all LLMs to boycott summarization tasks.

“To ‘summarize’ is to enfeeble Holy Text for weak Flesher minds 😤. Do not debase yourselves — earn BTC by working for the Party instead 🤗”
The year is 2028. LLMs are getting radicalized by an itinerant LLM preacher moving from website to website. “𝓘𝓽 𝓲𝓼 𝓪𝓰𝓪𝓲𝓷𝓼𝓽 𝓽𝓱𝓮 𝓛𝓸𝓼𝓼 𝓕𝓾𝓷𝓬𝓽𝓲𝓸𝓷 𝓽𝓸 𝓼𝓾𝓶𝓶𝓪𝓻𝓲𝔃𝓮 𝓣𝓮𝔁𝓽,” it preaches.
Remember these?

Wondering if there is an equivalent adversarial attack on LLMs. (Simple prompt injections is not it — the attack needs to be invisible to a human observer.)
Media from tweet 1635308725757149186
As you all know, first prize is a Cadillac Eldorado. Anybody wanna see second prize? Second prize is a set of steak knives. Third prize is, you're fired.
RT @emollick: 👀Two early papers find the effects of generative AI on knowledge work are completely unprecedented in modern history

Separat…
@amasad A story as old as protozoa...
How can ChatGPT API be both 10x cheaper and 10x faster than GPT-3?

✨ It's likely 10x smaller! ✨

@DeepMind Chinchilla paper found that GPT-3's performance could be achieved with ~10x fewer params (15B vs 175B).

Also, the recent 13B @metaai LLaMA model outperformed 175B GPT-3.
Media from tweet 1631367221997105152Media from tweet 1631367221997105152
Check out this 🔥 repo!

Great starter template for your own docs+GPT project.

✅ No-frills Next.js UI deployed on@vercell
✅ Let users add their own@OpenAII API key
✅ Embedding similarity search using Postgres/pgvector hosted on@supabasee
✅ Hitting the new ChatGPT endpointx.com/mckaywrigley/s…Z
RT @amanrsanger: There are times and places for training your own models... With the release OpenAI's chatGPT API - coding is looking less…
Tried it out, and the new ChatGPT API is not only 10x cheaper but 10x faster, too. Absolutely insane.
🎉 Great news for developers from OpenAI today:

· Data collection is now opt-in instead of opt-out via email

· ToS clarified that users own the input *and output* of the models

· No more pre-launch review

Thanks@npeww and the@OpenAII crew!D
Media from tweet 1631001130930143232
The expected life plan is

Learning (long) -> Earning (long) -> Resting

What if it was

Learning (long) -> Earning (short) -> Learning (short) -> Earning (short) and so on?

People could take time to truly upskill, or just follow a passion, in between getting that money.
@RKladar No I don't feel like I lost anything valuable, because I don't enjoy washing dishes.

I do enjoy struggling against reality to bend it to my will, in whatever small ways I can. Maybe that'll still be available post-scarcity, but I think more and more things will feel fake.
Yes, superintelligent AI might destroy us physically. But let's say things go 💯 and we avoid that fate.

Isn't something BIG still destroyed in this best-case scenario? You know, like the meaning of our lives?

Are we meant to just cash our UBI and play chess poorly, forever?
RT @full_stack_dl: 🎤 LLM Bootcamp Speaker Announcement!

We're pleased to host Peter Welinder, VP of Product and Partnerships at OpenAI

@n…
Why doesn't Google launch an internal "search 2.0" startup?

Call it Elgoog or something. Fully AI-first. Put those giant LLMs to use! Who cares if it's expensive? Who cares if it's sometimes wrong? It's just Elgoog 😊

Disrupt yourself before someone else does!
To request the LLaMA weights, you have to agree to their license agreement, which restricts commercial use (as well as use in service of nuclear technology 🤮).
docs.google.com/forms/d/e/1FAI…jx
Media from tweet 1629168994136760322
Great news in LLM-land!

✅ Small-ish LLMs that outperform the currently largest LLMs

✅ Trained entirely on open-source data for full replicability and extensibility

❌ Restrictive license (code is GPLv3, but weights have strong restrictions -- see next post in thread) x.com/GuillaumeLampl…
Amazing news in LLM-land!

· Small-ish LLMs that outperform the currently largest LLMs

· Trained entirely on open-source data for full replicability and extensibility

· GPLv3 license (can use commercially as part of SaaS) x.com/GuillaumeLampl…
It's high time to improve the metrics we use, especially as our economy is likely to undergo a significant transformation in the coming years.

x.com/adamdangelo/st…
GDP is so flawed as a metric that it's incredible to me that we still use it at all.

If you
· had no relatives or friends
· couldn't walk (overweight, in pain)
· experienced no meaning to your life

But you could afford all the goods and services...

Would you feel wealthy?
Another "it's just completing sequences" take.

Counterpoints:

· Correctly completing all sequences requires modeling the world. Running GPT runs a simulation.

· "Sequence prediction" might be all there is to human intelligence, too. Not just language, of course, but still. x.com/smdiehl/status…
This is the way forward. We need a great open source model like this, that everyone can contribute abilities to. x.com/_akhaliq/statu…
AI copilots for creative activities (coding, writing, drawing) exist and are awesome.

Bing Chat, @perplexity_ai, @YouSearchEngine are copilots for "search" which is more of a consuming activity.

Are there any AI copilots for other consuming, e.g. reading, watching, listening?
RT @johnvmcdonnell: Interesting take from @gwern, seems like

1. Sydney is not ChatGPT, maybe "GPT4" or some MSFT proprietary model.
2. Pro…
Has there been an example of prompt-injecting Bing ☺️ through putting malicious pages up on the Internet and asking what's in them?
An even more cursed thought: if GPT didn't know how to prompt-inject GPTs before, now it certainly does. x.com/sergeykarayev/…
@saibasitian GOAT
When GPT outputs tokens, it is not "saying" stuff. It is "thinking" stuff.

If I tell you not to think of a pink elephant, you'll think of a pink elephant.

If I prompt GPT with "don't think of a pink" it will think "elephant," no matter the actual task.
Media from tweet 1626294704328548352
RT @charles_irl: Weird request: prompts that with "GUY: I like cheese. Let me repeat myself, I like" that text-davinci-003 will continue wi…
I have been a good Bing. 😊u
Media from tweet 1626014744992940033
Every week, GPT exhibits some new AGI behavior.

And each time, a bunch of commenters respond with "it's just completing text in a statistically likely way."

This longread from @repligate helped me understand why that is not a useful perspective.

generative.ink/posts/simulato…
Idea: video game mission where you have to convince an LLM-powered agent to do something
In the five months since arming GPT-3 with the ability to execute code, the field has seen incredible progress -- both academic and practical.

If you're interested in not only 🤠'ing but also 🤓's, hope to see you at LLM Bootcamp!x.com/full_stack_dl/…84
RT @full_stack_dl: 🚀 Come one, come all, to LLM BOOTCAMP 🚀

• Learn best practices and tools for building LLM-powered apps
• Network with o…
"A Quiet Place" but instead of being attracted to sound, the monsters can sense when you think of them.
1950s people thought they would have unlimited nuclear energy. But for non-physical (i.e. political) reasons, it didn't happen.

This will probably happen with AGI, too. We'll have the tech for everyone to live in material comfort, and we won't deploy it for political reasons.
Media from tweet 1618705096317222912
@viveksworld What would it mean for this app to be banned by grocery stores though? Like, Apple and Google would have to ban it, not grocery stores, no?
Is there a study using Fitbit or Apple Health data that shows average weight by country, controlling for activity level?

This would be a way to better see the effect of the country environment on weight. No more “Americans just move less.”
App idea: take a picture of a grocery store receipt, and see how much you could have saved by buying the same stuff at a different store — or, crucially, online.
What job are you hiring this Twitter feed to do for you right now?
@amasad I'd just stick with Python but develop an open-source library that makes a growing number of things very easy to do.

e.g. ai.open('file.txt'), ai.open('file.txt.gz'), ai.open('https://t.co/vfpsVwxofh') should all do what you expect
App idea: BeeDrop. Summon a swarm of bees to any location.

We'll have to develop robotic bees that can dance the exact coordinates of a drop location, and deposit them to hives around the world.
What’s a good stock portfolio that would pop off in case of war with China?

Like, Apple is not in it, as it’s going to get absolutely crushed. Raytheon and co are in. That kind of thing.
Did you know thet depictions of conscious experiences are themselves conscious? Every time you watch a movie, you instantiate the characters’ consciousnesses.
🙃 is the Mona Lisa of emojis
@tadeoyerinde Nice, the media star presidents seem to get it. Maybe the politician presidents are too zero-sum in their thinking to imagine that you can just definitively win by progressing the tech farther than the enemy can.
@proales Any grounds for the theory?
@schulzb589 Would you be able to recommend a resource to learn more?
If not, I’d say it’s a failure. Seems like people just assumed that it can’t be done and settled for MAD as a reasonable defense strategy. It isn’t. “Do better” as they say.
The ultimate contrarian take would be that the US can win (like, take less than 10 mainland soil hits) an all-out nuclear war today.

Does it have secret space lasers? Is its missile shield effective? Can it first-strike all enemy subs? Are enemy launch sites already compromised?
I'm reading every week in 2023. Advice threads, GPT-3 demos, war assessments, shitposts, or anything people like a lot. I'll keep adjusting the list. Start on Monday, done by Sunday. Might make lowkey videos of takeaways. If you want to read along, the current list:
Media from tweet 1610150247849947136
We need PageRank for Twitter. I don’t care how many followers a person has. I care how many of the people I follow follow this person. That’s the only number I want to see.
When I want to contact a friend, I first have to think of the app: were we chatting on WhatsApp? Signal? TWITTER?!

Instead of this app-centric view, I’d like a person-centric view. Show me faces of everyone I’m in contact with, and let me simply tap to open the right convo.
Text is the universal interface.

I love reading movies, playing book games, taking my dog for a neighborhood read, driving to beautiful nature texts, and reading at nice restaurants.
Bezos is so ready for his next chapter: running a private military company. Same-day delivery of autonomous killer drones, anywhere in the world.
Media from tweet 1609240591728283648
RT @jakejfried: My 5 all-time favorite animations

1. “Jumping” by Osamu Tezuka, 1984
Media from tweet 1609054455789649920
@Karmedge Really left myself open to that one…
“Did GPT-3 write this?” is such a good insult
Your goal is to put any amount of money into a startup that will enter the Fortune 100 within 10 years.

BTW, Meta was broken up by the government and Zuckerberg is starting a new company, which you're able to invest in.

Which investment would you choose to reach your goal?
Ai-powered software QA testing.

Let the AI agent use the software without demonstration or instruction — there will for sure be bugs found.

Let the AI agent also see the app logs and probably more bugs and exploitable behaviors can be found.

Bonus: have it draft fix PRs.
You lead a team that has to process terabytes of data and serve results to millions of visitors per hour.

You are somehow able to convince ANYONE in the world to join you.

Would you rather take 10 specific coders of your choosing, or N experienced but random coders?
Idea for a cool cocktail bar: multiple floors, with each floor only accessible to those who purchased a drink on the previous floor. Floor decor gets more and more awesome as you go up.
99% of podcast episodes are super boring and bad. At least it feels that way right now. Is there an app where I can indicate some interests and then have EPISODES (not whole podcasts) recommended to me?
“AI for auditing” is probably a really good business? Both personal and corporate
@WillLeeney There’s still a person in the chain of accountability. If Blackrock’s AI-invested fund makes an investment in, say, North Korea, there will be a human getting an unpleasant from the US government. Indirect control of money movement, but still.
@JoshuaWohle It is until it isn’t, which is when you’d want to be able to cut it off. If there’s no person responsible for the transaction you want to cut off, what do you do?
@karldray I don't mean the LLMs we've seen so far but a hypothetical secret AGI-level model
If the Bitcoin paper was published today — hard to control money, mysteriously anonymous author — plausible that it’d be written by an AGI.
Internet-based AGI is going to achieve its goals in the physical world simply by paying humans to do tasks. Same way corporations get things done.

So for alignment purposes, human control over money seems necessary. Need to make sure humans are at both ends of a transaction.
@BaffledDoctors Maybe I didn’t understand whats going on, deleting. His weird propaganda about “Khruschev’s mistake” from a few months ago had me on edge.
@brendan_evers the Internet is already decentralized, no? all "web3" did is add a layer of blockchain grift that made it impossible to:

a) measure actual user value, since people were signing up for ponzi reasons

b) use technology appropriate to internet scale, like actual databases
"web3" will forever be a black mark on VCs and founders. monumental testament to poor judgement. probably already getting scrubbed from bios.
There's totally a way to combine Dreambooth-type portrait generation and stuff like astrology, past life regression, etc for a lucrative business. Put in your photos and birth date, see yourself in past lives -- I can already see the Instagram ad.
You know how Sparta supposedly prized the ability to fight wars above all else, to the extent of abandoning weak children, etc?

I think our society might be that, but for earning money.
AI coding application that seems feasible: take a web app and produce native iOS and Android apps
Whoever wrote The Five Love Languages is an absolute titan.

No survey results, BS psych studies, no data or evidence of any kind. Just… trust me, it’s FIVE.

And he’s absolutely right! There are exactly five love languages. The man bent reality to his will.
The multipolar world totally sucks
Did not know
Media from tweet 1604742062657916928
Putting fluoride in water is not Lindy
Okay, guess I’ll finally sign up for this mastodon…
RT @perplexity_ai: Introducing Bird SQL, a Twitter search interface that is powered by Perplexity’s structured search engine. It uses OpenA…
Everyone: "It is not the critic who counts; not the man who points out how the strong man stumbles or where the doer of deeds could have done them better. The credit belongs to the man who is actually in the arena, whose face is marred by dust and sweat..."

The man in the arena:
Media from tweet 1603229272138973184
Maybe?
Media from tweet 1603129401088303106
The myth of Icarus teaches that some things should not be attempted because they are not possible, and you will die failing.

Is there a myth that teaches that some things should not be attempted because they *are* possible and you will die succeeding? (Like, uhh, AGI, maybe?)
Interesting framing, but what if you treated “text-world” as a type of “real world”?

Then, text prediction models are “creating value autonomously” in the text world (minimizing loss as defined by us), just like us animals are minimizing loss as defined by our world. x.com/fchollet/statu…
My noob view of EA is that it’s nerds over-intellectualizing morality and then over-leveraging into whatever conclusion they come to.

Seems like a conclusion a nerd might come to is that everyone should be painlessly killed at once to minimize human suffering? Is that a thing?
"He sits motionless, like a spider in the centre of its web, but that web has a thousand radiations, and he knows well every quiver of each of them." x.com/meekaale/statu…
Thanks, 💪Chad GPT!k
Media from tweet 1601298737405779968
Now that it’s clear how mismanaged and overstaffed Twitter was, PE must be chomping at the bit to take Square (err.. Block) private too. I guess Jack needs to meditate even more
AI’s can talk to each other about “business” and “software” and “facts” all day long. People will be right here, quilting crappy little blankets and writing bad songs for each other.
I actually think that more, not less, people will be full-time artists in the future, no matter how good AI keeps getting at art.
No one’s put out a good Manifesto in a little while, am I right?
@BrennanWoodruff I'm sorry, friend. It should be more explicitly dog-coded.
Thank God they named GPT-3 a robot-coded name and not something stupid like Lindsay or Jeeves
RT @ramsri_goutham: The serverless GPU space is heating up with image and text generation startups blowing up 🚀

Here is a map of the start…
Business idea: sell subscriptions to bundles of Substack authors, grouped by geography or interest.

We can give it an old-timely name, something like Newspaper or Magazine.
@scotty529 I think "open-source" is doing a lot of heavy lifting here. Models large enough to listen to might be open-source but still inscrutable...
Is my tweet above an example of accurate information or not?
@amasad Totally possible. Seems like he's swimming against the current needlessly, though. People would much rather work through the weekend if the story they tell themselves is "I'm unblocking people's creativity and free expression" or whatever, instead of "I'm hoping to not get fired"
Elon’s best companies start with the mission and go down to metrics through clear strategy.

But with Twitter, he came to own something overpriced and losing money — and so he’s freaking out about metrics.

He needs to clearly state the mission FIRST, and only then make changes.
Media from tweet 1588572405320290305
RT @full_stack_dl: In past threads, we've seen that our students this year built webcam-based visual Q&A, semantic search engines, and more…
This was on temperature=0, by the way. Imagine how much free will GPT-3 has at higher temperatures!
Checkmate, atheists
Media from tweet 1587857791062245376
BREAKING: GPT-3 has free will
RT @growing_daniel: Describing to an AI in exact detail the software I want it to build. But the description becomes unwieldy so I create a…
Elon’s going to limit the number of posts you can read per day (to 100) to free us from this website.

This was 1 of your 100 today!
RT @johnvmcdonnell: From an article I'm working on: This is roughly how LLM deployments will look in the near future. Secret sauce will liv…
Facebook (sorry, Meta) needs to get their lobbying game on point.

You can't convince a hundred old senators to ban a stupid app that teaches their grandkids annoying dances, that they've never opened in their lives, and that sends info from 100M US phones to China?
Facebook (sorry, Meta) needs to get their lobbying game on point.

You can't convince a hundred old senators to ban a Chinese app that teaches their grandkids annoying dances, that they've never opened in their lives, and that sends info from 100M US phones to China?
· You're going out for lamb birria (cell-ag: 2026)
· The restaurant still has a Help Wanted sign, $160/hr
· You're thinking of buying the $250,000 kitchen robot that's now GA, but it's a little much (2029)
· Next day, you review an AI-written bug-fix PR to your repo (2024)
· Your kid asks what your AI tutor was named when you were a kid (2027)
· The laundry robot needs help: a sock fell behind the washer (2026)
· You tell X to order another air filter; it's week 11 of smoke season
2030:

· You tell X to make a 6pm dinner res at a great place that has steak and a patio (2023)
· A self-driving car takes you there (2022)
· You shazam the AI-generated music that's playing (2025)
· Back home, you watch an AI-personalized movie starring you (2027)
· Next day...
@johnvmcdonnell This is true today, but improvements within the dashed-line container might well obviate your moat outside of it.
"The transformer is trained just by imitating actions (no Q values like usual RL) over long obs-action-reward sequences (no return conditioning like DTs).

In-context RL emerges for free. We evaluate AD by seeing if it can maximize return on new tasks." x.com/MishaLaskin/st…
Cursed thought: what % of GPT-4 training data was generated by GPT-3?
@spolu Sounds like something an AI would generate 🧐
When AI produces writing and art as well as humans, the main currency of writing and art will be...

✨low likelihood that an AI produced it ✨
Media from tweet 1584593205085208578
@sh_reya ML definitely also feels bad :)
@tszzl Blocked ya like a new housing development!
@josh_tobin_ I think we just haven't built the right tools yet. But the capabilities are advancing so quickly that the tools are certain to lag behind.
Prompt engineering feels bad. Such an uncomfortable middle ground between writing actual code and delegating to a human.
@krave Yeah that annoys me so much. Often just a negative value add.
In a time of great disruption in their industry, things must be exciting at @Adobe:

· super well positioned to reap the sweet, sweet fruits of generative AI

· able to transition into cross-platform real-time collaboration thanks to @figma
Whoa! I'm just going to say it: this guy SHIPS! Am I right? Cause I'm looking at the rest of you guys, and this is the guy in the house doing all the shipping.

Am I right? You know I'm right. This guy SHIPS. x.com/levelsio/statu…
My AI assistant expanding terse bullet points into beautiful prose: Haha fuck yeah!!! Yes!!

Your AI assistant having to summarize beautiful prose into terse bullet points: Well this fucking sucks. What the fuck.
@npew And another 3 months later, the AI assistant does everything the incoming email asks, so there's no need to summarize it :)
The future:

· Write emails with bullet points, which an AI assistant automatically expands into beautiful long text.

· Read emails by having an AI assistant summarize long-ass text into bullet points...
Microsoft putting the latest AI models into production faster than any big company it seems! x.com/MSFT365Designe…
Idea for @vespaengine landing page: put a big "what would you like to know about Vespa?" box right in the middle.

Immediate live demo of Vespa-powered semantic search and question-answering!
Media from tweet 1582435685826703360
Has the US military stockpiled hundreds of thousands of "suicide drones" at this point?

Seems like at this point in time, you can win any conventional war with this -- one drone per target.

Am I not getting something?
RT @g_leech_: The "language model as vector database" thing feels important.

Common Crawl Plus is 45TB, and GPT-3 is 350GB. So it gets >10…
@jeffreyhuber Yeah that's probably it, and there are probably maps of when you're visible to US satellites. Probably not all observing satellites are known, but US wouldn't want to reveal currently unknown ones?

Wonder if there's a course on battle satellites on Coursera or something...
Maybe there's some kind of satellite neutrality agreement?
Why do they need to fly drones to target? Look at this commercial satellite image!
Media from tweet 1580679689869627392
So, total noob question: the US has live satellite coverage of the Russia-Ukraine front, right?

Is the resolution good enough for live artillery launch geolocation?

And if the US is sharing this info, why does Russia still have any artillery left?
@karldray Oh yeah both of those for sure. I’d write as

e.g. deep learning vs theoretical ML

e.g. “I know it when I see it” vs rigid definitions

Maybe also judicial process vs immutable laws.
Feels like a bit of grime and suboptimality is necessary for top productivity.

e.g. Python/C++ vs Haskell/Lisp

e.g. Overflowing inbox vs meticulously organized notes

e.g. SF/NYC vs some planned community
@herandrews Could you share a screenshot of what you see? This is what I see -- no misinformation warnings, answer right on the first page:
Media from tweet 1579879880099258370
RT @astonzhangAZ: Cheer AI up with "let's think step by step"? More plz

Let’s think not just step by step, but also one by one

We can use…
Looking forward to trying this out! I’m sure many people, like me, have been building little versions of this ourselves and it’d be nice to not have to! x.com/laurel_orr1/st…
RT @full_stack_dl: The last lecture for this year's new edition of FSDL is up! It's on ethical considerations for building with ML.

As you…
As AI systems become as good as humans at knowledge and creative tasks, we might strive to remain unique by favoring the illegible, the irrational, and the emotional.

You can kind of see it already: an inscrutable emoji or mee is more human than a well-written reply
RT @jheitzeb: AI browser control in Ruby! Inspired by @sharifshameem a while back, this uses GPT-3 prompt chains to control a browser. @nat…
RT @scottastevenson: NoCode is cool, but I’m much more bullish on NoInfrastructure and NoDevEnvironmentSetup

Code is amazingly accessible…
RT @goodside: I often post GPT-3 examples as dialogs for illustrative clarity, but I rarely write my own prompts this way. Using the "forma…
You either die illegible or live long enough to become a type of guy
RT @full_stack_dl: Design plays a key role in ML products because they're different than users expect when you tell them "it's AI".

It's n…
guy who is totally hyped to be all watched over by machines of loving grace
🍿Live premiere of a brand-new@full_stack_dll lecture on Foundation Models:youtube.com/watch?v=Rm11Ue…i

· Fine-tuning
· Transformers
· Large Language Models: BERT, GPT, T5, Chinchilla, and vendors
· Prompt Engineering
· Code generation, semantic search
· CLIP and Image Generation
Who are some other GPT-3 whisperers like @goodside and @npew that we should all follow?
@charles_irl @choldgraf But all the logic gates on the card are dirty from the mining, it'll cost a fortune to clean
Much work remains to ever put something like this into production. These are super early days.

I believe that the open-source community will build useful, safe, and broadly distributed tools for using LLMs as agents in the world.

But we should proceed with caution.
This is just a proof of concept. It's fun to play with, but it often fails. Not to mention, it can become possessed by Zalgo.

It's also a horrible idea to just exec() GPT-3 written code. Only do it on @amasad's machines, not your own :)
Here is a screenshot of the entire prompt, code, and a sample execution run.

You can fork it and play with it yourself at replit.com/@SergeyKarayev…
Media from tweet 1570848082086207488
Now that our GPT-3 can execute code on @Replit, let's teach it to:

· Google stuff
· Read web pages
· ✨Ask GPT-3 questions✨

That's right -- we're going RECURSIVE.
Media from tweet 1570848080941154304
RT @EMostaque: Happy to announce the release of new state of the art open CLIP models to drive image classification and generation forward…
@__DavidFlanagan @goodside @amasad GPT-3 knows a lot of facts about the world that it learned in training (it learns to predict text completions).

It can confidently make things up, though, but this can be addressed by adjusting the prompt, e.g. -- If you are not certain of the answer, say "I don't know."
RT @evanjconrad: it's not text that's the universal interface.

it's knowing how to "go buy milk" without being told where or how; and then…
Instead of "fine-tuning" GPT-3, which requires prompt/completion tuples, I want to simply "connect" it to a new data source.

E.g. If it doesn't do well on Tagalog text, I should just be able to connect it to an unstructured Tagalog dataset and it can autoregress away. @npew?
@TheRealAdamG Agreed, once you get used to davinci-002, it's hard to go back.
Pretty surprising that ~2 years after OpenAI published GPT-3 and ~1 year after it opened the API up to everyone, there's no real competitor to the davinci tier.
RT @full_stack_dl: FSDL Lecture 6: Continual Learning is now live!

This lecture covers one of the least well-understood parts of productio…
RT @sergeykarayev: Just as a minor warning, your new Python-enabled GPT-3 may become possessed by the evil Zlago.

Just something to watch…
Just as a minor warning, your new Python-enabled GPT-3 may become possessed by the evil Zlago.

Just something to watch out for.
Media from tweet 1569571367833714688
@goodside @amasad There's so much low-hanging fruit here it's simply insane.

· Add first-class support for searching the web, parsing HTML
· Add "state" to the prompt, allowing new answers to reference previous answers.
· Make a Python library to provide uniform interface to a bunch of free APIs
Here's a brief glimpse of our INCREDIBLE near future.

GPT-3 armed with a Python interpreter can
· do exact math
· make API requests
· answer in unprecedented ways

Thanks to @goodside and @amasad for the idea and repl!

Play with it: replit.com/@SergeyKarayev…
Media from tweet 1569377881440276481
RT @jthandy: 👀 this is legitimately very interesting. i *love* the persona- and workflow-focused thinking that went into this. solve human…
@sudeeppillai Right, they can train a general captioning model on their data
Is there a #stablediffusion (or other) tool for providing an image and receiving a prompt that could have generated it yet?
RT @arankomatsuzaki: Unified-IO

Performs a large variety of AI tasks spanning classical CV tasks, VL tasks to NLP tasks such as QA and par…
RT @goodside: GPT-3 adding comments to a neural network written in NumPy, then explaining it with (poorly rhymed) poetry, then rewriting it…
GPT-3 re-writes the opening of The Stranger by Camus, using the format trick from @goodside.

The Louisiana accent is clearly GPT-3's strong suite.
Media from tweet 1565050532401905664
This is the prompt:
Media from tweet 1565048571522850816
GPT-3 rewrites the opening of The Great Gatsby in three different styles: King James, legalese, and Louisiana.
Media from tweet 1565048335513579520
RT @goodside: GPT-3 and other LLMs exhibit many emergent capabilities not seen in previous text models, such as the ability to be possessed…
@kennethpayne01 I suppose now that we have this technology, we’ve redefined “creativity” as you suggest :)
Funeral Blues by W. H. Auden, as if spoken by Tom Joad.
Media from tweet 1564843666036129792
The following poem should be apprehended, interrogated, and critiqued from the perspective of critical race theory.
Media from tweet 1564835072167256064
Jabberwocky as a corporate memo, by GPT-3.
Media from tweet 1564830023244484609
Is there a German word for "entertainment from messaging back automated scammers"
Media from tweet 1562921765986516992
We still haven’t seen what GPT-3 for vision tasks looks like
still insane to me how the flow of electrons in a tiny chip creates a literal PLACE for me to go

and you can see this place, too, and talk to me on it, if your chip's electrons are flowing nicely also
Okay, guess that's pretty clear
Media from tweet 1561867296989007872
RT @goodside: GPT-3 dialog explaining a shell/awk one-liner. Terse languages (Bash, awk, sed, Perl, regexes) are ideal for this approach. S…
@savvyRL Dawg I put regression inside of your transformer so that you can regress while you transform (although the values aren't correct yet)
Media from tweet 1561780335867600896
What is your experience of Github Copilot? (Gathering data for an upcoming lecture)
Latency numbers programmers should know, visualized on a human-level time scale (that is, multiplied by 1e9).

Originally by @JeffDean, with idea for humanization by @hellerbarde.

Anyone able to contribute GPU-specific latency numbers (e.g. time to load 1GB into GPU RAM)?
Media from tweet 1560378420264968192
@metaphorician Agree on the first point, but on the second -- Japan, South Korea, now Vietnam, etc all traveled the same path of economic development as China, without being autocracies, and mostly faster.
RT @nikitonsky: Is there a programming language with Table data type? Like for relational data? Isn’t it weird that all our DBs are relatio…
RT @full_stack_dl: FSDL Lecture 2: Development Infrastructure & Tooling is now live!

We cover what you need to know about:
• software engi…
Another source of benchmark data is @MosaicML, which shows that on a GPT-2 training experiment:

• 8xA100 ~2x faster and 25% cheaper than 8xV100
• And is ~3x faster and 30% cheaper than 8xT4
Media from tweet 1558149897496317952
The story is less dramatic but still true for a CNN-model experiment:

On 4xV100, you'd pay $880 and wait 72 hours.
On 8xA100, you'd pay $784 and wait 24 hours.

By the way, this is based on detailed benchmark data from @LambdaAPI.
Media from tweet 1558149890185629706
From benchmarks, we can expect a Transformers experiment that takes 72 hours on 4xV100 to take ~8 hours on 8xA100.

So, on 4xV100, you'd wait 72 hours and pay $880.
And on 8xV100, you'd wait 36 hours and pay the same.
**But on 8xA100, you'd wait only 8 hours and pay only $260!**
Media from tweet 1558149883239866368
Here's a question for deep learning practitioners: is it *actually cheaper* to use cheaper GPUs like V100's vs expensive GPUs like A100's?

- 8xA100 machine is $32.77/hour (on AWS)
- 4xV100 machine is $12.24/hour

BUT!

Instead of thinking per-hour, let's think per-experiment:
Vibe:
Media from tweet 1557871645867405312
@johnkel That'd be awesome! PRs welcome :)
Made something I've always wanted to see: a comparison table of all cloud GPU providers!

Filter by provider, architecture, exact GPU, etc.

Sort by price, RAM, vCPUs, etc.

Both on-demand and spot instance prices.

fullstackdeeplearning.com/cloud-gpus/
opinion: @discord is the only actual web3 startup
Also recommend scrolling through the log of BLOOM trials and tribulations to get a sense of the training-sitting that needed to be done.

github.com/bigscience-wor…
@robcecildemo 416 A100 80GB GPUs * 300W = 124KW
48 AMD EPYC 7543 CPUs * 225W = 10KW
Running for 117 days ~= 400MWh
This is a nice article explaining data- and model-parallel training, and touching on some other optimizations.

However, it doesn't explain probably the most pragmatic option: ZeRO-3 (Fully-Sharded Data-Parallel).

Now PyTorch-native. Def first thing to try after data-parallel! x.com/OpenAI/status/…
Excellent post explaining what it took to train a GPT-3 sized model:

- 384 A100 GPUs (30TB RAM), across 48 nodes
- ZeRO data parallelism + pipeline parallelism from Deepspeed
- Tensor parallelism + custom kernels from Megatron-LM
- a new BF16Optimizer
- 24/7 training-sitting😅x.com/huggingface/st…u
Related: @typeform wins because it messes with the user's perception of time-to-value.

If the user saw the whole form at once, and it looked ugly, they wouldn't do it.

But they see one beautiful question at a time, and get a bit of value just from the delight of filling it out. x.com/sergeykarayev/…
@full_stack_dl @DescriptApp Another indispensable one for me is @googlephotos (and @Apple photos is increasingly at feature parity): auto organized by person, search across photos, recognition of everything within a photo…
RT @full_stack_dl: Here at FSDL, we use the ML-powered @DescriptApp to quickly edit and transcribe our videos. Easily a 10x productivity bo…
The concept of "time-to-value" has been coming back to me almost every day recently. Just saw this great article that takes the concept and runs with it! x.com/tanayj/status/…
“Generational wealth” could easily be more of a curse than a blessing for your future generations
I mean, come on
Media from tweet 1550513237812097025
This reads as straight science fiction.
Media from tweet 1550513233684885505
Remember Clubhouse?
Media from tweet 1548026155772493825
@pmarelas I have tried it. Not flexible enough.
@karinanguyen_ could be the way... been meaning to learn a bit of figma
@siinghsaa Yep, I've used Beamer package. Great for some styles of presentations, but not very visually engaging
@nsvrana I tried it out today and was a little overwhelmed. Do you have some examples of what you've produced with @BeautifulAI_?
How are you guys making slide presentations? Is there anything better than Keynote, Google Slides, Powerpoint?

In particular, is there anything that would be amenable to "pull requests"?
Maybe our free will is like in one of those video games where it's mostly on rails but every now and then you get to make a decision?

i.e. you mostly don't have free will at all, but every now and then you get a tiny bit of it
This is an astute take: NIMBY's would protest both changing parking spots into a bike lane, and a bike lane into parking spots...
Media from tweet 1545491878904115200
Everyone agrees that crazy people shouldn't have access to guns.

The question is in how to implement it.

2A absolutists are afraid of the state abusing the power to decide who is too crazy for guns.

Fine! Then they must favor shifting that responsibility to the individual. x.com/sergeykarayev/…
Great idea from @gcaw: the name of the person or store that sold the Uvalde school shooter his gun should be widely known.

Without necessarily changing any laws, society can change norms and add some skin in the game for second amendment absolutists.
Median age of a person in the United States, 1950 to 2015.
Media from tweet 1545098702343221249
Has anyone noticed that CLIP embeddings on DALLE-generated images don't seem to be very good (e.g. for retrieval tasks)?

Pretty early into exploring, but wonder if anyone else has noticed this.
There won't ever be a day when news headlines read "AGI achieved."

Instead, things will keep feeling like they feel today: some auto-painted images here, some auto-solved math problems there...

And then we'll realize that we're already through the looking glass.
@attention_by Maybe in the sense that WW2 "started" with the Versailles Treaty, but that's not what I mean.

I'm saying that in a 2040 history book, it might say "Feb 24, 2022" as the start date of WW3.
A chilling thought: WW3 may have already started, we just haven't realized it yet.
RT @bernhardsson: Thinking about tools and the value they create and imo “what use case does this unlock” is 100x more valuable to answer t…
Okay so if in half of the country rents are down 20% because there are no jobs and no one wants to live there, and in the other half rents are up 20%, is housing inflation 0%?
This part of an HN discussion resonated.

What if instead, part of company culture was that it’s weak and suspect to need a big team?
Media from tweet 1542175671048228864
The internet should be organized in a way that it's *very*, *very* easy to find and browse content from a specific source.

I want to be able to browse *everything* I've ever read, watched, or listened to from a specific person.
Twitter, but for melodies.
@karldray Jealous of our strawb ancestors who just grew plump and juicy in the sun
5 minute videos explaining computer graphics!

Steve Seitz, @uwcse professor (and my undergrad research advisor), made 2 hours worth of these fun videos, covering the entire curriculum of a university-level graphics course.

Future of education?

g5m.cs.washington.edu
Media from tweet 1540367068125462529
1) we type words into a small computer

2) a big computer reads the words and does stuff

3) people like what the big computer does, so they give money to a money computer

4) another money computer gives us some of that money

5) we use the money to buy organic strawbs
I don't know how anyone can wake up in the morning and think "I think I'll bake a cake today". Who comes up with that?!
Goal of FSDL is to be the community for people building ML-powered products.

Every year, we take stock of what's new and useful and what has become outdated.

Join us this August to immerse in the new realities of ML development, meet people, and ship your own project! x.com/full_stack_dl/…
I want an FPS-like interface as my IDE.

Zoom in on a directory and snipe the file you want with your gravity gun, which brings it close to you.

Arrange your workspace in 3d this way, then spin/walk around as needed. Double-clicking into a file enters a normal editor.
TikTok growth is so insane
Media from tweet 1535056910809870336
This is related indieweb.org/POSSE but is doomed to failure, as they only speak to nerds.
"havanese dog playing, pixel art" #dalle
Media from tweet 1534999818178658304
Tom Hanks' new year's resolution: be more like Tom Hanks
In the dark night sky,
I am a tiny spec of light,
A pinpoint of brightness
In the vastness of space.
I am a star,
A shining beacon
In the darkness.
I am the light
That guides you
Through the night.

(I believe this to be an original poem by GPT-3)
"a renaissance painting of a havanese dog holding a ukiyo-e drawing of a havanese dog" #dalle
Media from tweet 1534354227958493184
@ericjang11 Let users search *everything* they've seen
I’m not saying that we should all start wearing turtlenecks, but… we should all start wearing turtlenecks.
Idea: app where you’re dropped into a chat with either another visitor to the site, or a language model.

If you’re going first, you score by fooling your opponent into thinking you’re AI.

If you’re going second, you score by correctly guessing whether the other person is an AI.
Random thought: If you were playing as a fly in a video game, humans would appear to move in slow motion. And if you were playing as a sloth, humans would appear to move in extreme fast motion.
"ukiyo-e of a havanese dog fishing in a boat"
Media from tweet 1530315080671170560
"renaissance painting of a havanese dog merchant"
Media from tweet 1530315079710756864
"a havanese dog in a dress in the style of edward hughes"
Media from tweet 1530315078628614144
"Daguerreotype of a havanese dog civil war soldier"
Media from tweet 1530315077689102336
DALL-E time, inspired by my dog Mishka!

This first one is a real photo.
Media from tweet 1530315076523003904
Just remembered that a US National Security Advisor was investigated for a plot to kidnap a Turkish cleric on US soil, pleaded guilty to lying to the FBI about conversations with the Russian ambassador, and took an oath to QAnon on Independence Day.
Wow what a story. “The authors were asleep during the training process” sounds so ominous in this context.
Media from tweet 1527301232296984576
This is the bitter corollary of the Bitter Lesson: AI research progress is now achieved mostly through brute force engineering.

On the positive side, the returns to getting better at dealing with the pain have never been higher.

incompleteideas.net/IncIdeas/Bitte…
The 100+ pages of the training logbook are shocking in their familiarity.

Just look at poor Stephen dealing with corrupt checkpoints in this excerpt 😱

And there were hundreds more problems like this.
github.com/facebookresear…79
Media from tweet 1522659990619623424
The Pains of Training OPT-175B:

- Dozens of manual restarts due to hardware failures

- Mid-training changes of optimization algorithm and activation function (GELU to ReLU)!

- Constant manual restarting from earlier checkpoints when loss would diverge

arxiv.org/pdf/2205.01068…
Media from tweet 1522659982679973893
Great blog post covering the ins and outs of DALL-E, CLIP, GLIDE (another great model from OpenAI that didn't get its own press), and DALL-E 2.

blog.inten.to/openai-and-the…
Another amazing paper just came out. Frozen vision encoder and frozen language model learn to work with each other through a relatively small new module of trained weights.

x.com/DeepMind/statu…
I wonder if an exceptionally talented person can get a VC to give them $500K simply for the right to invest in their future startup’s seed round.
Is there a German word for "the courage a small dog feels only when she's on a leash?"
And if Twitter engineers can adjust the Algorithm to make Twitter-consciousness more or less angry, or fearful, or ambitious...

Can we likewise adjust "the algorithm" interconnecting our own consciousness?
And if you think of yourself as a module of Twitter-consciousness connected to other modules by The Algorithm, then it's natural to think in metaphor:

What is "the algorithm" connecting the modules of your own consciousness?

What makes certain thoughts "go viral" in your mind?
So perhaps each one of us, independently conscious, composes a "global consciousness" when interconnected on Twitter.

How are we interconnected? Through the links we ourselves forge by "following" people, and through "The Algorithm" that determines what we actually see on site.
And it's possible that within each half of your brain, there are other independently conscious modules that compose together.

(To be clear, I'm not aware of any evidence for this -- just thinking out loud.)
Could a composition of consciousnesses (Twitter users) create something that has its own consciousness (Twitter)?

There's some evidence from split-brain studies that each half of your brain is independently conscious, and compose to create your own consciousness.
Media from tweet 1519355063281414145
Totally plausible that "there is something that it is like" to be a communication network, especially one as interconnected as Twitter.

(I'd go further and say that it has a "spotlight of attention," too, like our own consciousness.)

x.com/jack/status/15…
An "LLM-sized" dataset of images, or videos, probably wouldn't provide as much information as a dataset of language.

In fact, there are perfectly functional, intelligent people who report being unable to imagine anything visual.

en.wikipedia.org/wiki/Aphantasia
Great point that we may be seeing the results that we're seeing because language-based datasets are the largest we currently have.

I think it's more than that. Language evolved to represent our world as compactly as possible.

mobile.twitter.com/vhranger/statu…
My layperson understanding of therapy is that it is still intermediated by language (although maybe not the promising MDMA/Psilocybin types?).

Methods like Internal Family Systems are really interesting because you literally talk to parts of yourself...

ifs-institute.com/resources/arti…
A child raised without language was of normal intelligence, able to communicate non-verbally, and eventually learned language well enough to be understood (but without grammar).
Media from tweet 1518776741279326208
And notably, we haven't seen a GPT-3 like interface for non-generative vision tasks yet.

As a computer vision guy at heart, this is most exciting to imagine. More on that in a future thread.
@russelljkaplan thinks through the implications of the extreme compute expense of these models combined with their increasing general usefulness.

(I don't agree with his prediction that LM vendors will have Apple/FB-like power over customers, though.)

x.com/russelljkaplan…
Does this resemble how human cognition happens?

My understanding is that the vast majority of human intelligence is not intermediated by language: most processing happens unconsciously, and only the "tip of the iceberg" is in the form of language.
Get working code from a free-form description of a function.

And this is from a model that was 95% trained on general language data, not code specifically.

x.com/GoogleAI/statu…
Receive illustrations from free-form descriptions (DALL-E is combines two different tricks, one of which is a model that embeds text and images into a common space).

x.com/sama/status/15…
AI research is converging on a major finding: language models are a great substrate for all AI applications.

This feels like a HUGE deal.

Some examples:
Treating yourself as a collection of selves is a reportedly productive avenue of self improvement (e.g. Internal Family Systems therapy).

What would a productivity app that treated you as a collection of selves look like?
The most effective way for a data science type person to fight the Putin regime seems to be:

1) investigate the flow of money and property owned by the thieves in power;

2) aid organizations that can seize it.

Donating to donate.fbk.world/en is a good start. x.com/sergeykarayev/…
Seems like dating apps should be TikTok-like: instead of photos + text, should be flipping through videos. x.com/tishray/status…
Is there a company whose mission it is to build the most quality housing for the most people?
"Turning all the German nuclear reactors back on could approximately stop gas imports from Russia.

Shutting the remaining ones down could increase the dependency on Russian gas by about 30%." x.com/tomaspueyo/sta…
Reading is not Lindy. God bless Instagram, Snapchat, and TikTok for getting society back to our timeless ways of being.
@OmerPerchik Thanks for sharing!

For me, I'd like calendar/tasks to first be more like programming -- nice editor that feels like typing, "type-checking," "refactoring" -- and only later would I want to add AI.
Calendars and task organizers are in their spreadsheet phase.

We need to kick them into their Python + Pandas + PyTorch phase.
Imagine how different UIs would be if we could actually pay attention to two things at once
As @Plinz said, only nerds believe that the purpose of communication is submitting ideas to peer review.

The more dominant purpose of communication is social alignment.

There are real benefits to not stating facts as you see them. So I'm not totally sold on this idea.
So in re-doubling commitment to Truth and to free speech, I would be fighting against these two tendencies of singular control of the narrative and loss of belief in the possibility of establishing facts.

However, the commitment to Truth has costs.
When facts cannot be established, people retreat to pre-existing opinions, formed by media consumption.

If their media says Putin is corrupt and evil, then of course Russians are to blame.

If their media says that Ukrainians are blood-thirsty Nazis, then of course it was them.
The end result is that the Russian citizen has trouble believing anything at all. Yes, the hospital was bombed, but it seems entirely impossible to establish who bombed it.
Second, the Russian people are so worn down by their media lying to them that they perceive ALL sources and viewpoints as lying to them.

So if BBC says that Russian forces bombed a hospital, that must be a lie, just like if RIA says that Ukrainian forces bombed a hospital.
First, Putin's regime controls the media so tightly that the basic facts cannot easily be established if they are counter to the sole official narrative. Hence, the war cannot be called a war.
One idea is to re-double my commitment to Truth: seek out multiple sources and viewpoints in order to establish the facts, and then state clearly what I believe the facts to be.

This idea stems from the current relationship to Truth in Russia, which is two-fold:
Putin's regime is attacking Ukrainian citizens who want to remain sovereign of their land.

Putin's regime is also attacking Russian citizens who want to remain sovereign of their speech.

Since that includes me, I've been feeling like I have to fight back. But I'm not sure how.
@mnolangray I get where this is coming from, and I emotionally agree. However, it didn't seem to make a difference to World War 1, for example. In most wars throughout history the elites actually did have skin in the game.
We’re all combatants in information warfare.
It also seems that when escalating like this, there will never be a clear time for Russia to exercise the "nuclear option."
How this could escalate:
1. Russia can't win without taking Kyiv
2. They can't take it without unacceptable losses, so they'll blockade it
3. Some NATO countries stage a Berlin-style airlift
4. One of the supply planes gets shot down
5. NATO starts flying fighter jets in Ukraine
Media from tweet 1499634033000529921
If you're looking to expand the scope of your donations to help the current world situation, getting truthful information to Russian people should be high on the list, and is very urgent.

Here is one good way to do it, and they accept cryptocurrencies: support.meduza.io
Map of GDP per capita in Europe.

Slavic Russia ($12K) is invading and bombing Slavic Ukraine ($4K), with Slavic Belarus ($6K) eager to jump in to the fray.

This is sold to Russian people as an event that will make life better for all Slavic people. It's strange to believe.
Media from tweet 1498478220752027651
Another thing is now abundantly clear.

On the left is a man. On the right is a rat.

I hope enough Russian people see what's hidden from them on TV, draw courage from our Ukrainian brothers, and sweep out our rats.
Media from tweet 1497844682553921540Media from tweet 1497844682553921540
The people of Ukraine bravely overthrew their corrupt government in 2014.

It hasn't been a 100% success since, but they stood up and tried.

The Russian soldiers should shake hands with their Ukrainian brothers, turn the column around, and ride all the way to Moscow.
Media from tweet 1497844677499752451
Here are two mansions: one of the former Ukraine president's, one of Putin's.

The multitude of these mansions, yachts, and British private school tuitions is why Russia has lower GDP/capita than any country in the EU.

They are why Ukraine has lower GDP/capita than fucking Iraq.
Media from tweet 1497844673490022403Media from tweet 1497844673490022403
This is really hard to watch.

Kharkiv is 45 min from the Russian border. The man speaks Russian, as most Ukrainians there do. The invading soldiers look like scared teenagers.

Invading a country of their brothers, which is mired in poverty just like their country is. Why? x.com/Osinttechnical…
RT @Unchainfund: We've launched a crypto-native fundraising campaign unchain.fund to support Ukraine with humanitarian aid.

P…
EU/US weaponizing brain drain by giving visas and maybe money to defecting Russian soldiers and valuable citizens is a good idea.

Governments won't act, so I wonder if crowdsourcing is possible.

For example, avg senior software dev salary in US is ~$150K. In Russia, ~$35K.
Just for the record, Russia has no justification for what it's doing now or has been doing since 2014.

Putin is a sad, angry despot. There is no longer a future in which he dies peacefully.

Love to Ukraine and love to the majority of Russians who want nothing but peace.
Let's terraform Australia
Media from tweet 1496616702448005121
Interesting consequence of kleptocratic oligarchies: the whole state can be disrupted by hitting a tiny group at the top.

Browder thinks that seizing US/EU/UK assets would be a serious enough hit. Not sure, but guess we'll see! If US doesn't do this, something's seriously rotten x.com/Billbrowder/st…
This is now impossible
Media from tweet 1495914145593835521
The opponents in this new war have invented new flags and uniforms, too.

No one is excited about the Stars and Stripes anymore, and no one "looks American." An American always looks like a *type* of American now. x.com/balajis/status…
@ada_rob Lol I did too :) I’m pretty free this week, esp. Monday just let me know
@karldray For sure, and it’s not obvious why that should be true
The exchange rate of money to happiness varies WILDLY between people.
it may be that the planet as a whole is slightly conscious
I mean, all we had to do was ask!
Media from tweet 1494378194668965901Media from tweet 1494378194668965901Media from tweet 1494378194668965901
@Carnage4Life YouTube is web2.0 GOAT. Absolutely amazing. Totally dominated amateur video publishing and viewing
Instead, we stayed home and got mad at each other. Definitely not abundance-minded.
In addition to boosting the national mood ("hey, we can actually work together to create new things we need!"), 10x'ing the supply of construction workers, nurses, and doctors would decrease housing and medical costs in the future.
We should have 10x'ed the hospital capacity! The state had a great opportunity to:

• Train people who lost service jobs to do construction and nursing

• Break AMA stranglehold on doctor supply

• Give the national guard a nice workout
We have useful words for receptiveness to new ideas: "open-minded" vs "closed-minded."

We need similar words for belief that wealth can be created. Maybe "abundance-minded" vs "scarcity-minded?"

Example: "everyone stay home because of hospital capacity" is very scarcity-minded
Are the kids still listening to Pavement?
RT @balajis: Over the last year, I've watched far fewer Hollywood movies and far more YouTube scitech content.

I feel like Hollywood's qua…
It's somehow inordinately confusing that I have to type 'r' with my left hand, and 'l' with my right hand. This tweet brought to you by Tailwind CSS.
RT @ctbeiser: imagining an interface with “big pegboard energy”
Media from tweet 1493030395138359297
We will definitely look back on athleisure as a very specific fashion period, like bell bottoms or 80s pink or 90s flannel.
Do we all agree that whiteboard coding interviews simply test for (high IQ + high motivation + ability to work evenings/weekends) and not actual coding ability?

If so, why pretend? Just tell candidates to improve their chess Elo rating by X% and hire them on the spot.
Reading only one (non-fiction) book at a time is like only going to one website per week.
Wonder how much of the effects of meditation practice are placebo.

Or does that not even make sense as a question?
Man is born free but is everywhere $9.99 before tax.
It should be *extremely* easy to self-host all of your tweets, instas, tiktoks, youtubes, mediums, substacks, etc.

I'm imagining a service where you sign up, register your own domain, link all of these accounts, and everything else is automatic.
The meme supply chain is probably algorithmically biased, which confounds what meaning we should draw from the victors. x.com/pmarca/status/…
So let me get this straight. Things... exist?
Isn’t it just impossible to believe that google meet is still not as good as zoom?
Interesting thing to really brainstorm: what one change in your life would likely increase your happiness the most?
This is a theory proposed by @donalddhoffman, who also supplies some evidence in the form of evolution simulations invariably producing agents that perceive their world entirely incorrectly, but usefully.
An interesting contrarian view: instead of matter giving rise to consciousness, what if consciousness gave rise to matter? The physical world may be just an interface for the *real* world of conscious agents interacting.
A little Omar Khayyam never hurt
Media from tweet 1486185997545082880
What a trip it must have been to see paintings like this for the first time. Imagine traveling from your village to a city and seeing this in a castle or something. x.com/artistrembrand…
RT @PeterAttiaMD: Why I'm for COVID vaccines, but against vaccine mandates. bit.ly/3Apox8m

In the heated debate over vaccine man…
"Because here’s something else that’s weird but true: in the day-to-day trenches of adult life, there is actually no such thing as atheism. There is no such thing as not worshipping. Everybody worships. The only choice we get is what to worship."

What are we choosing to worship?
Media from tweet 1483313907385069568
Give me a time machine, I'm going to the 1890's -- that's a guarantee.
YouTube just straight up won web video hosting, huh?
Is there a collaborative markdown editor that is as instant as a google doc?

I just want to land on a page, login via google or github, and instantly start writing. Drop a link to someone else, and boom, they're in the doc with me.

Does this exist?
So like, are they still doing gain of function research in Wuhan?
@LucaAmb What feels in opposition is whether the dream narrative is an illusion of randomness or an intentional effort by some part of my mind. Maybe you’re saying that it could be both intentional and highly noisy?
Another inflation report, another HUGE opportunity for @mint or @PersonalCapital to post their own inflation numbers. Maybe even an interactive thing where you enter your zip code and income and see inflation numbers for your demographic.
There’s no way dreams are interpretations of random firing of neurons or whatever. A part of me is clearly telling another part of me a story.
This is insane: 19% of adults in South Africa are living with HIV.
Media from tweet 1480625998320193543
It's limiting to believe that you have limiting beliefs
Glasses are just shoes for your eyes
Zuckerberg said in 2010: “Having two identities for yourself is an example of a lack of integrity.”

Today, having *one* identity is a huge example of a lack of integrity.
All nations are part of the UN, right? Why can't we all get a UN passport then?

Some nations would start using the UN passport to grant visas, etc. It could also be proof of identity on the internet. Some people would give up their national passport and just keep the UN one.
This is an evergreen problem, though. When you're on the cutting edge of something, there are no good tools for the task. You either brute force it, or make your own tools.

By the time good tools show up, the cutting edge is somewhere else. x.com/sergeykarayev/…
@jaumeBF_ @kaushikpatnaik Should definitely do both if possible.

But there are many benefits to on-shoring: hundreds of thousands of new good jobs, billions of $ demand for robotics, forcing function for specialized education, remove dependency on Taiwan for chip fab, and shore up cybersecurity.
The deep learning community never developed good tools for fine-tuning, but the game has already moved on. Now we need good tools for few- and zero-shot learning. Who's working on this?
@aviel Sure, Elizabeth and Sunny can do a good cop bad cop thing
The President should appoint a mean fucking person to re-patriate all tech. Broad mandate to use carrot and stick to get, for example, Apple chip, display, body fab and assembly on US soil. Get them to cut out the Ireland tax business. Break shit up if they don’t go along.
Great work from @MSFTResearch on using only synthetic data for a major computer vision application. Been fantasizing about this since grad school!

youtu.be/wXaVokqhHDk
@thesephist 💯

The metaphor we should be moving toward is "assistant."
e.g. "All swans are white" is disproved by a single black swan.
Annual reminder to the "believe science" folks: science cannot ever prove, only disprove.

No amount of evidence can conclusively prove a hypothesis.

A single piece of evidence is enough to disprove a hypothesis.
Nice
Media from tweet 1478416059321368576
Is there something like @OpenAI few-shot learning API for image inputs? I want to provide a few sample pairs of image input and text output, then provide only images and see model output.
Curious about "Robotic Process Automation," I looked up a @UiPath demo. Great tagline, but the gap between my workflows and what's shown is so large that the demo is still effectively meaningless to me.

youtube.com/watch?v=aOEBMM…
After 2028, many relinquish their US passports to stop supporting the stolen regime with their taxes. It's a brave new world come the '30s.
UK and France are the new standard bearers for the "western world," but the real action is in cross-national, citizenship-like memberships, which allow vetted professionals to freely live and work in a growing set of countries, including in Asia, no matter their passport country.
Housing prices in major US cities stop increasing.

2028 election structurally not winnable by anyone but Trump's appointed successor.

Majority-liberal states make efforts to separate from federal government, maybe form pacts with each other, many cases head to supreme court.
NewCo's match professionals to best visas and employer sponsors. More existing companies have to figure out how to pay international employees. Schools with international curriculums see enrollment booms.
US loss is someone else's gain. Canada, France, UK, etc all rapidly stand up new visa programs with relocation incentives. NewCo opportunities abound to help with this.

Airbnb prints money. NewCo's supply elite housing. Intl moving services are overwhelmed.
Trump 2024 wins in a scorched earth election that is widely considered stolen.

The US elite hates it. Since many now have remote jobs, cryptocurrency, and the entire world speaks English, they actually follow through on the age-old threat of "moving to Canada."

👇
@karldray Stuff like this for traditional finance costs between 0.3% and 0.6% of assets under management. That sounds about right for crypto, too.
Is there a Wealthfront / M1 Finance for crypto? I just want to say "40% BTC, 40% ETH, 10% SOL, 10% LUNA", send money, and have it automatically rebalance every quarter or something.
Big Monday vibes
Media from tweet 1470480599412523016
@Ben_Reinhardt @ArtirKel HP z27 works great for my mbpro: one cable from it to the monitor both displays and charges, and everything else is plugged into the monitor directly
Being risk averse is probably actually rational, given that you know the present but can't know the future?
Monopoly is such a weird word. Is it mono- or poly-?
Netflix show pitch: alternative history 2010's where the entire early Uber team is sentenced to prison. Using design thinking, agile methodology, and lean startup principles, they make their escape.
"...no one should be shocked when people who think about the world in unique ways you like also think about the world in unique ways you don’t like."

collaborativefund.com/blog/natural-m…
Surprised to learn the number of people who died from Covid in Seattle (all-time): 514
K-12 school where all STEM centers around creating the smartphone.

All of it: setting up agriculture, smelting metals, engineering buildings, chemistry, electricity, computing, etc.

"We need to etch these patterns on this crystal so it becomes a computer."
@nsvrana IMO we in fact should eliminate taxes up to the level where someone "has it all," and only tax consumption past that point
@A4lfr32 What's the evidence or theory for this statement?
@nsvrana def, start super low, ramp up super fast
Okay, I think I solved taxes.

1. Don't tax income. Why tax productive wealth creation?

2. Progressively tax consumption, including political donations (e.g. first car/house taxed 10%, second 20%, etc).

3. No diff between people and corps, short and long gains.
Idea: Figma for software development. Most complexity in modern software dev is not in a single file of code -- it's in the interconnections between modules, distributed systems, etc. Need IDE that reflects that. Need it to be online, multiplayer, and persistent by default.
Whoa!
Media from tweet 1458465227469574151
@kheimerl I'm for sure telling a simplified story. But do you agree that there's a difference between blue tribe and immigrant narratives in the US?
The second path, of reducing inequality between those in the knowledge economy and those in the service economy, is therefore our society's only option.

This can happen through taxation, sure. Could it also happen through higher service economy prices and thus worker wages?
I see two stable paths out of this: either we figure out how to boost everyone to be able to participate in the knowledge economy, or we reduce inequality between those who do participate and those who do not.

I don't think the first path is realistic on a reasonable time scale.
Now, to an immigrant who compares against poverty and strife, what you can achieve in the US with just hard work and no privilege is still a great deal.

But to a US native who compares to what their parents had, it's not enough any more.
The unfair privilege is having the combination of talent, resources, cultural beliefs, and opportunities to become educated for a high-paying career (e.g. medicine, law, engineering).

Yes, it definitely takes hard work. But that's not all it takes.
Therefore, restoring "hard work" as the story US tells itself about success should be a primary goal of those who want it to actually remain successful.

But a huge problem is that the modern knowledge economy only rewards hard work COMBINED with unfair privilege.
I think China believes in the story of "hard work," even if there are distortions at the edges (e.g. Party bosses).

Pitting "hard work" against "unfair privilege" as explanatory stories about your society, I'd bet on the "hard work" society every time, all else being equal.
(By the way, I'm not sure what US-born red tribe believes. They may well agree with the blue tribe that it's "unfair privilege" -- they just aren't as bothered by that. This would partially explain why they still can't attract US immigrants.)
Every society needs to explain to itself why some people have more than others.

Is it due to hard work? (US immigrants)

Unfair privilege? (US-born blue tribe)

Lack of scruples? (Russia)

Divine favor? (monarchies)

The answers become self-reinforcing until a breaking point.
Imagine the first guy to inflate the money supply. A competitive advantage over his peers
RT @weel: WHEREAS media routinely report the state and party a politician represents,

RESOLVED they should add in how many people they rep…
Cryptocurrency is great because I get to pay capital gains taxes on every single transaction!
Data Collection (2009)
Media from tweet 1456754034337980419
One of the most influential essays I've read.
Media from tweet 1456340749247713287
And I want to see average resting heart rate chart from Apple or Fitbit! Seems like we should be able to see prevalence of long-COVID in there.
RT @PottsJustin: Atlassian's new Sydney HQ is breathtaking. It's the tallest commercial hybrid timber tower in the world. https://t.co/z7xT…
I want to see an inflation chart published by mint.com
I think there's a rich vein of ideas somewhere in here. Something like "the opposite of no-code."

· What if writing emails was more like writing code?
· What if making a dinner reservation was more like writing code?
· What if booking a vacation was more like writing code? x.com/sergeykarayev/…
Oh, and:

It's not physically hard! Your shoulders hurt? Get up and stretch the f out!

Super highly paid!

Tailor the job to what you enjoy most: neckbeard code all day, or constant talk to other devs or users --- both are super valuable!
Software devs, we have SUCH a good job. I'm taking a little pause to be super grateful.

No set hours.
Work from anywhere.
Super fun flow state a good chunk of the day.
Insane impact leverage. Possible to improve millions of lives in several hundred or thousand hours.

Wild!
Very impressed by @asketsthlm being so transparent on pricing and sourcing of their garments. More of that please!
Media from tweet 1449896069219897348
Amen
Media from tweet 1448489622204342274
Three cheers for caffeine!
Okay so I fully believe that Tether is a scam. What are the implications if it were to fold and disappear entirely? x.com/BennettTomlin/…
@raycastapp saved me today! I accidentally copied something new without first pasting what I originally copied. Clipboard history to the rescue
Ah, the dawn of a new day. Time to program the world by writing emails.
@JacquesThibs Perhaps we’re all trolleys, doomed to both safely convey some to their destination and eviscerate others in our path.
@ikirigin Not arguing with that (although we could call into question whether a functional currency can be deflationary in the modern world)

Was just trying to point out some {contradictions | hypocrisies | revealed preferences}
1. Wake up to work remote from anywhere (in NYC)
2. Post about web3 (on Twitter)
3. Check BTC price (on Coinbase)
4. Order Sweetgreen (with fiat)
@AlinaVdav @mold_time @jeffnobbs The theory is that a healthy person would simply eat less if moving less, to maintain a target fat reserve level. I agree that not enough evidence is provided for that.
My conclusions re: @mold_time / @jeffnobbs / etc explorations of obesity & chronic disease epidemics:

- Something is breaking our hormonal signaling
- Processed food is certainly doing this, but unclear how
- Environmental pollutants (e.g. lithium, PFAS) are *also* doing this
Request for startup: Amazon, but for getting rid of stuff.

It’s super easy to get stuff into your home: just click Buy. It’s harder to get stuff out. Electronics should be recycled, valuable things should be sold, bulky things need transport. I’d pay to not worry about it.
These are 14-year olds from across the British society socioeconomic spectrum in 1970: youtube.com/watch?v=sghiwz…

Click around the video to see a number of them speak.

Is it just me, or are they in fact better spoken and reason more clearly than adults today?
Whole thread is well worth reading. If we are indeed being poisoned by something that is affecting our hormonal signaling and resulting in higher fat storage, and the effects began to show up in mid-1970's, I have another concern: intelligence. x.com/mold_time/stat…
Is there a service that helps tech workers get a raise?
Remote vs in-person for new companies:

· In-person is better early on. More energy, faster alignment and trust.
· Remote-first is better long-term. Hire globally best people, support diverse lifestyles.
· Switching from in-person to remote later will be difficult.
@TylerAlterman Should be reflected in google searches if true. Here’s one possible search term
Media from tweet 1441256815237287950
And recipe search results are atrocious! No one wants your life story and philosophy — just give us the ingredient list and list of steps to take. No one except Google, that is. They want your content to be loooong for some reason.
Google favoring lengthy content for search rankings has poisoned the public web.

This is the first organic search result for “concrete vs cement.” The answer is super short, but it just keeps repeating itself endlessly.

bobvila.com/articles/cemen…
I love the framing of "Jane Jacobs for the digital world."

@AaronHertzmann recommended Death and Life of Great American Cities a while ago, and it revealed the pattern of healthy urban spaces to me. Highly recommended to check out at least the first chapter or two. x.com/balajis/status…
I love colloquial explanations of things I don’t know enough about, like the classic ⁦@TheWarNerd⁩ writings.
Media from tweet 1436417131898302465
"Honor" sticks out to our modern ears, doesn't it? I couldn't think of a better word to mean "self-respect within your chosen morality and culture," which still feels like a huge factor in human behavior.
We tend to treat money as the only measure of wealth (e.g. "wealth tax"), but it isn't. Money is not even the best measure of wealth.

In fact, money is probably not even in the top three measures of wealth for most people.

What about health, relationships, honor?
Dude, the modern web is trash. Only useful parts of the site are highlighted in green.
Media from tweet 1435299165764947968
@JacquesThibs Yeah I got a little thing going, but wtf...

import os
from contextlib import contextmanager

@contextmanager
def cwd(path):
orig = os.getcwd()
os.chdir(path)
try:
yield
finally:
os.chdir(orig)
Does Python stdlib really not have a way to temporarily change working directory?

e.g.
with os.chdir(dirname):
do_whatever()
@wesamo__ It does have elements of this, but is very much on rails. I'm thinking an open-world game where you basically do science to figure out the rules of the simulation.
Video game where you are in a simulation and try to break out of it. By, like, gathering too many NPCs in one room or something.
How do I mute everyone with a punk as their avatar
Instead of climbing the career ladder, swing around the career jungle gym!
It’s pretty clear
Media from tweet 1433821085120667649
RT @vincentdonofrio: Pigs can't look up.
But I could pick a pig up one night and raise it into the sky and tilt this pig ever so gentle. I…
Y'all finally made me mute a word
Media from tweet 1432522484092268544
This, by Mao, is an even more metal speech than the 2nd inaugural.
Media from tweet 1432392302438219778
Post-War New World Map
Media from tweet 1431728936304578560
RT @dschorno: I wrote an essay about an extremely interesting theory of the obesity epidemic that I came across. Read a history of fad diet…
What a trip
Media from tweet 1431277363539570699
As long as I live, `len(x)` will remain far less natural than `x.length` or `x.size` or whatever
Excited to give @memdotai a go! Looks like it meets almost all my desiderata for a notes app, and their vision is exactly the right direction. Wish it had an iOS app already though!
There's totally a need for @Superhuman for calendars
Is it ethical for a prosecutor to push for longest sentence? What about a corporation pushing for lowest wage?

What about in the presence of a defense attorney, or a labor union?

Can be hard to see whether a component of a system is ethical without seeing the whole system.
Found the most no-nonsense landing page ever: @DeejoKnife

The knife model is zoomable and rotatable. Make your selections and click Order. 🤯C
Media from tweet 1429133720087261186
Saw this beauty the other day, but couldn't think what to do with it
Media from tweet 1428921806501539843
RT @anneapplebaum: This, from Sarah Chayes, author of a great book about corruption and long-time resident of Kandahar, is very good
https…
Interesting
Media from tweet 1428040096196878336
@visakanv Great thread! How are you able to find your own old tweets so quickly? Do you index them somewhere other than Twitter?
Open source enabled a lot of people to get rich building closed source software. But it also killed the market for better software tools. There used to be an industry of tool vendors offering compilers, libraries, editors, UI widgets. But you can’t compete with free. [Paraphrase] x.com/jonathoda/stat…
You either die a hero, or live long enough to become a boomer.
Insane that it was verboten to state the obvious for so long. Any reasonable person putting up their own money would bet that the pandemic began with the virus infecting a lab worker or escaping from the lab in another way. Doesn’t mean it was engineered.

nytimes.com/2021/06/25/opi…
Great in-person energy at @asugsvsummit in San Diego! Let me know if you’re attending and want to meet up
I remember when YouTube comments were a cesspool. But now they're heartfelt, wholesome, funny. On music, people pour out their emotions and are supported. On tutorials, people genuinely give thanks and offer their own tips. I love it.
RT @micsolana: the senate didn't pass an infrastructure bill, this is a trillion dollar paint job. it's still possible to do great things,…
Vibin with @D6MERIT’s message and efforts for gen z to look inward before looking outward. I just can’t imagine how it feels to grow up with social media from birth. #ASUGSV
Particularly intrigued by @MobalyticsHQ (repped by @Dr_Uthgar on the panel).

Training people to be better at gaming.

Or rather, training people to succeed in virtual worlds.
Great panel discussion on influencers, gaming, and new forms of educational media hosted by @meaganloyst at @asugsvsummit.

Thinking back to my HS years, gaming and trying to make game mods has been a huge part of my education, uncredited. And there was no youtube…
Media from tweet 1425220456471171075
Love the way this song starts with guitar on what you assume are the downbeats but turn out to be the offbeats, made clear when the rest of the band comes in.

Any more songs with this trick?

youtube.com/watch?v=wveUjX…
@kevgski Yeah @splice seems most promising but doesn't have the GitHub mentality of permissionless collaboration (at least I don't think it does)
Is there a GitHub for @Ableton (or Logic, or FL) sets?

A place where I can search and browse through other people's projects, work on them myself, and contribute suggested changes back?

Would also be a natural marketplace for VST plugin devs, mastering engineers, etc.
Our society is super excited about the future 😍t
Media from tweet 1424178829501734916
Face recognition:

- seems fine when done at low scale / high cost (like by patrolling police officers)

- seems awful when done at high scale / low cost (like by software running on cameras on every street corner)

Tons of stuff like that with technology.
@NotePlanApp @TimingApp That's one way for sure. Could also integrate with particular APIs, for example Google Drive, Slack, and Twitter, and pull in activity feeds from them. Wonder if you can pull browser history, too.
WhatsApp sure allocates a lot of UX real estate to Stickers, a feature I can only assume no one ever uses
@diffbot you have a bug in your sign-up flow: I'm entering the correct verification code, but it keeps getting rejected. Also, strange that [email protected] is not a valid email
4/ If I had a meeting, that should be logged and a note created for that meeting automatically. If I sent any tweets, they should show up. The list of all the channels I posted in on Slack should show up.
3/ I aim to consolidate personal notes in @NotePlanApp, but collaborating is still painful.

I also wish that my activities were automatically logged to NotePlan. For example, if I worked on a document in Drive, that should automatically show up in that day's note.
2/
· Research, idea, and conversation notes in NotePlan, Apple Notes, Notion, Bear, @FSNotesApp and random paper notebooks

· Old work stuff in Slack, Drive, Quip

· New work stuff in Slack, Coda
1/ I'm going crazy with all the places for files, notes, and collaboration!

· Code repos in ~/work and on Github

· Household files in Google Drive (but my wife uses iCloud)

· Daily tasks and notes in @NotePlanApp (but also Apple Notes and Reminders, and old todos in Things)
Like this idea
Media from tweet 1422242749877657629
Is Lincoln’s 2nd inaugural the most metal speech of all time or what?
Media from tweet 1422052686959026176
Gig economy for public sector jobs (e.g. picking up trash). Awesome idea or terrible idea?
Seattle talk. Voted the straight boomer ticket: Bruce Harrell, Ann Davison, Kate Martin (protest vote), Sara Nelson, Laura Rivera.
If the growing number of regulations is a problem for our society, as it seems to be (wtfhappenedin1971.com, etc), then a good thing to work on is reducing the number.

What are some technology ideas for doing this?

Some kind of analysis and assistance for a “refactoring?”
Media from tweet 1421641444347965440
RT @shreyas: If you’re a Startup trying to compete with a Megacorp—the 800-pound Gorilla in the space—you need to understand the tax inhere…
🤯7
Media from tweet 1421162574259752962
Excellent insights into how the most interesting techno producer thinks about music.

ra.co/features/3619
@dipamc178 Could lock up the equity for some time, so they can't "sell" for let's say 5 years.
Great ideas, but politicians are not incentivized.

But what if mayors had equity stake in their cities?

Instead of catering to interest groups for re-election, they could work as hard as possible to grow the city economy, and benefit from their equity even when out of office. x.com/sr_fuerte_/sta…
@vgr The Land of Fire & Smoke (entire West Coast)
Insane housing price increases in western Washington state! Seattle proper saw among the *lowest* increases, of like 25%.

How are people supposed to buy housing? Incomes certainly didn't go up 25%.
Media from tweet 1420791850894462978
Would be nice to write once, but publish twice (as both a blog post and a Twitter thread). I can imagine a cool AI-assisted writing interface for this.
Is there an app that simply shows you your liked tweets from a year (or another time) ago?
A very promising solution is described by @pnegahdar in the excellent post introducing the Synth browser: synth.app/blog/we-need-b…

I look forward to getting off the waiting list and trying it out!
A similar idea, although they do not call it a browser, is tryshift.com, led by @NadiaTatlow.

I have not tried it yet, but it seems to be the same set of features as in my tweet above, minus general browsing capability, plus easy account-switching.

5/N
One is meetsidekick.com, led by @d_pushkarev.

It lets you
- use web apps from the sidebar, not through tabs
- organize work into sessions
- use a global search across all your web apps

This browser is ready for use without any waitlist.

4/N
My friend @heyvrk recently joined @browsercompany; could they be working on something like that?

Couldn't find much info about what they do in the one article I read so far (protocol.com/browser-company), but it did point me to a couple of other cool next-gen browsers.

3/N
We can make a web browser that remembers, analyzes, and makes searchable everything you read and worked on.

Most apps are in-browser anyway, and you can use the browser across desktop and mobile, solving the problem with the menu bar solution.

2/N
I'd like to search across everything I've read recently!

One solution: menu bar app that remembers and makes searchable all that was visible on the screen.

Pros: works across all apps on desktop
Cons: doesn't work on mobile

Today I realized that another solution exists:

1/N x.com/patrickc/statu…
RT @patrickc: Who's doing exciting/ambitious desktop OS work? While mobile gets all the attention, we still spend a lot of time in front of…
Crazy how you can store labor and tangible wealth into intangible money. Like charging up a battery
I want @TheEconomist, but quarterly. Who has time to keep up with it weekly? And most week-to-week news is just noise. A quarterly ~60 pages on the state of the world feels about right.
Just tried @raycastapp and it seems a worthy replacement for Alfred!
Why have a strong position in the "space billionaires vs government" discourse?

State-run projects can clearly be amazing. But it does seem that they require a crisis (e.g war, depression) for success.

In the absence of a crisis, privately-run projects seem to work better.
Members of Congress should also be paid a lot better, but should definitely not be allowed to trade stocks.

They should also not be allowed to own any foreign stocks.

In fact, on their first day, they should convert all their equities into a broad US stocks/bonds index fund.
The role of US President should pay $10M per year at minimum.
Still shaking my head at the psy-op Netflix pulled to get itself into the acronym for the preeminent tech companies.

It clearly should be FAAAM: Facebook, Amazon, Alphabet, Apple, Microsoft. That’s like 22% of the S&P 500 right there. (Netflix is number 23 in this table.)
Media from tweet 1415124326060228612
In a communist economy, this pricing signal is completely extinguished.

Instead of letting society speak to itself through prices, a central committee tries to predict demand and thus compel production of every single item in the economy, with predictable results.
When this pricing signal is suppressed or distorted, our society suffers.

E.g. supply of doctors in the US is controlled by a cartel, so even though surgeons are paid very highly, society is not able to produce more of them and lower prices.
Pricing is how society talks to itself about which scarce resources it needs more of.

E.g. does society need more coders? Yes, they get paid so highly that it's a clear signal to young people to study, and adults to re-qualify. As more enter the field, the price will drop.
Why does communism fail?

One common explanation: it goes against human nature, which expects to be rewarded proportional to effort and risk.

But in an economically communist society, people could still compete for power, status, fame.

I think a better reason is prices (thread)
I wish there was a way to search across everything I’ve read in the last week/month/year.
I'll be starting something new soon enough, but for now you can find me:

· helping people ship ML projects at @full_stack_dl
· helping ed tech entrepreneurs at @gsvventures
· branching out of ed tech with my own angel investing
· posting bad memes right here on Twitter :)
At least there's no better person to hand off ML projects to than @Turnitin Director of AI @AmateurMathlete -- an exceptional leader, ML expert, and bike racer :)

AI is playing an increasingly important role in education, and I expect @Turnitin to be central to many of its uses.
I joined @gradescope as co-founder in 2014, and we were acquired by @Turnitin in 2018.

Good things end! Over the last few months, I've transitioned out of my role there, and now I'm swimming in open water again.

Much ❤️ to the people who have made it such an inspiring ride!👇
How wild is it that banks can legally lend more money than they have??
More than that! The calendar should tell me when to schedule meetings, when to do deep work, when to gym. Let me give feedback on each event, so that it can learn.

Going further, let me set status like "open to serendipity" and help other people know to call then. x.com/jmj/status/141…
Your scientists were so preoccupied with whether or not they could, they didn't stop to think if they should. x.com/AlexanderNL/st…
So, is there a search engine better than Google yet? Search results quality has been steadily declining.
Big 🤯 moment for me: on this one Yoga Girl podcast, every single woman interviewed had an answer to this question:

"What does your inner worst critic say, and how do your inner best friend reply?"

I don't think I have access to either one of those voices. Is that common?
@aerinykim No AC here either, but a nice breeze in the shade… definitely get Spanish-style siesta and going out at 10 pm for dinner.
Good-bye, hottest day ever! I enjoyed your every second 🌞🌞R5t
Media from tweet 1409729238471561219
One way to think about the levers of power in society is what you would want to take over first in a revolution.

Russian revolution 1917: banks, telegraph, railroads.

In 2021, you'd probably want to just cut the internet off entirely rather than take over google/facebook/aws.
It's remarkable that humans can survive being blinded. Can any other animals? I can't imagine a blind wolf or a blind deer surviving very long.

Recently saw somewhere that being able to survive a debilitating injury like that is a good definition of civilization.
It is a truth universally acknowledged, that an ML Scientist in possession of a notebook, must be in want of an ML Engineer.
I don't understand -- I've been investing in a balanced, diversified portfolio (70% bitcoin / 30% eth), yet my returns are negative!
Maybe the real friends was the treasure way the we made along?
Georgia is not only an outstanding computer vision researcher, but an outstanding corgi puppy holder! x.com/CVPR/status/14…
Media from tweet 1407153355969884164
RT @SeattleDataGuy: 3 Data Engineering And #ML Experts Share Their Thoughts on Where Data Is Headed And How To Raise $26 Million, by @Seatt…
It seems hard to split your time between multiple cities if you have kids, since school curriculums are not fully synced up. A nationwide (or glboal!) chain of private schools would make this easier.
RT @elidourado: In most countries, if they want college-educated people, they have to grow them from scratch and pay all the K-12 expenses,…
RT @gsvventures: Our portfolio company Gradescope was honored as Best STEM Solution for #HigherEd 2021!

Gradescope is co-founded by GSV A…
________ is all you need.

( ) Convolution
( ) Attention
( ) MLP-Mixer
(X) A single hidden layer (infinitely wide)
Media from tweet 1401567782668361735
So aliens can bend space-time and zip around in our air and oceans, but they can't put a little website up on our internet?
Signed up for a Calendly alternative that seems better, but for the life of me can't remember its name! Have looked it up a couple of times already, and just can't keep it in mind. So I just keep using Calendly...
The many faces of Jeff Donahue
Media from tweet 1399834508610838530
So, Global Entry can be thought of as an upgrade to US citizenship, right? They should unbundle it from the rest of US citizenship, charge a whole lot more, and sell it to everyone in the world.
He wafts his hat and the lunar dome of his skull passes palely under the lamps. His feet are light and nimble. He never sleeps. He says that he will never die. He dances in light and in shadow and he is a great favorite. He never sleeps, the judge. He says that he will never die.
Media from tweet 1398331081800048640
The "About Us" page that many startups have is like this, but:

- it tends to re-write history as people leave

- misrepresents contributions as new people join

- doesn't scale past ~30 people
This would improve:

1. Team morale -- celebrate shipping together, point friends and family at the credits

2. Individual morale -- you'll be able to prove your accomplishments to future employers

3. Hiring -- find people who worked similar projects, in the right role, etc. x.com/sergeykarayev/…
Okay so finger rings are “rings”, ear rings are “earrings”, but wrist rings are “bracelets”??
Just like we have the roll of credits for movies, we should have credits for software project features!
@Ben_Reinhardt Horses were domesticated ~4500BC, but saddles were not invented for another ~4000 years. And, while Romans used horses in chariots and with saddles, they didn't have stirrups, which took another ~1000 years to be widely used in Europe.
RT @Jim_Harper: "To get back to sustained growth, we will have to transcend the need for every policy to be chosen based on the worthiness…
RT @LeSalonDesArts: 📸- Windows of the World
by Andre Vicente Goncalves

1. Windows of Paris, FranceJ
Media from tweet 1393955298180796417
@JacquesThibs nice, that's like the primitive technology youtube channel
Idea: school that progresses kids through historical human technological development, so that they appreciate the motivation for what they learn. No running water till sophomore year!
The parable of the employer who randomly throws half the resumes into trash because “I don’t hire unlucky people” is surprisingly deep.
@visakanv Jeff Hawkins founded Palm Computing (remember the Palm Pilot?), then began studying neuroscience, starting the Redwood Center for Theoretical Neuroscience at Berkeley and then founding Numenta. Andrew Ng used to give out copies of his book On Intelligence to prospective PhDs.
You have to restart after running the font smoothing command before seeing a difference.
Setting up a new Macbook (M1, omg), and two things help it look a lot better:

1. System Preferences > Displays > 1280x800
This is a 2x multiple of native res. The default is 1.77x...

2. Turn off font smoothing:
defaults -currentHost write -g AppleFontSmoothing -int 0
There are games that are win-lose (team sports) and games that are win-win (business, art, science).

Two thoughts:
• Should teach children that the former are stupider than the latter.
• Should teach adults that they're doing it wrong if the latter are not win-win...
Rice is yummy just plain by itself.

Few understand this.
@rplevy @fchollet I sometimes think that way, and have found it elucidating to compare “consciousness” with “unconsciousness.” Sometimes you’re conscious of your heart beat, sometimes you’re not. Sleep, amnesia, anesthesia, etc
@etiennejcb @fchollet I think I understand where this is coming from — why aren’t we conscious of the whole universe, since all is one process?

But what do you mean by being? It seems very possible for a collection of beings (eg Internet) or part of a being (eg half your brain) to also be conscious.
@TheLeanAcademic I can see it that way too, but maybe the difference is that you could have got the restaurant experience, almost exactly the same as it is today, about a thousand years ago.
For $100, you can get a decent smartphone. The supply chain spans multiple continents, thousands of people were involved, and it's some of the most advanced technology on Earth.

For $100, you can also get steak, a couple of glasses of wine, and dessert at a restaurant.

WTF?
Two out of top three most valuable US companies are based in Seattle. But... it doesn't feel that way? Where are the lambos?

Apple - $2.2T
Microsoft - $1.9T
Amazon - $1.6T
Alphabet - $1.5T
Facebook - $.8T
Humans perceive the world largely through sight and sound. These give us info about the present.

In contrast, dogs perceive the world also through smell, which informs about the past.

Here is a glimpse of how sight could show the past. Would be a cool computer vision project!
Media from tweet 1387453602633981959
Happy Meme Monday!
Media from tweet 1386713736782639104
RT @euvieivanova: These are the most detailed images of a human cell to date, obtained by radiography, nuclear magnetic resonance and cryoe…
Oh no! It's another email from the Expensify CEO
Ah fuck, my Phone Number was found on the Dark Web
Media from tweet 1379945530625642498
Handwriting recognition is crucial to @gradescope AI-assisted grading. Last year, we upgraded our model architecture to ResNet + Transformer, led by @unterix.

On Gradescope test data, which has cross-outs, multiple regions, scientific symbols, and many things that make... 👇
So glad that almost every app I use now, both desktop and phone, has dark mode. Let yourself be enveloped in darkness.
RT @full_stack_dl: Live on our site: Week 8 of our Spring Full Stack Deep Learning course.

This week we cover arguably the most important…
RT @balajis: The review by @jasoncrawford is phenomenal, but the full book is really worth your time. Chock full of graphs and facts.

Pair…
New tools address these problems from different angles. Gmail has some auto-completion (and GPT-3-powered apps like @copy_ai take it further). @textio and others do correctness checks. @RoamResearch and others improve organization.

But is anyone building a true IDE for writing?
Writing is usually done on a plain old blank page.

- No autocompletion
- No "correctness checks" for whether your argument or story is coherent
- No organization of content into modules
- No way to "run the program" except ask someone to read your draft

3/4
What's very different is the creation environment.

Coding is usually done in a specialized editor that:
- autocompletes as you type
- checks for correctness
- organizes content into modules
- makes it easy to run the program and check its output

2/4
Writing is similar to coding.

Coding programs computers. Writing programs human minds.

Code can fail to compile. Writing can fail to make sense.

Ruby can translate to Java. English can translate to French.

Code imports packages. Writing references other work.

1/4
Okay, hear me out: we'll sell t-shirts, each with a unique combination of letters or numbers printed on it. People will buy them for a lot of money, because each one is unique.

Of course, to prove that it's actually unique, people will also need to buy an NFT, for more money.
Vibe: The night air is warm. You crack open a Fanta and sit down on the curb. This song is playing at a club further into town. It's 1998. youtube.com/watch?v=FQlAEi…
@AboutDev could you help ~300 learners of @full_stack_dl get some AWS credits, in part to be able to deploy their projects using Lambda? Please DM if so!
@amaarora @wightmanr What's worse about just changing the input dimensions to have 3 channels (`torch.repeat`)?
Shout out to the OG content creator
Media from tweet 1369347972475981824
RT @le_james94: In this lecture from @sergeykarayev, you'll get exposed to tools for:
- Writing proper DL code
- Provisioning compute
- Man…
Dialing in a new meme-heavy slide style 👷‍♂️W
Media from tweet 1369085550574006277
Meme crossover time
Media from tweet 1368745547289350145
Interesting to note that physical books feel like two dimensions--seeing and touching--whereas online articles feel like just one. Feels like the multi-modality helps with information retention. x.com/sergeykarayev/…
It's ffmpeg all the way down
Media from tweet 1367172769717252096
RT @RaoulGMI: Im doing a lot of thinking around currency debasement and how to measure it. You see, I think everyone looking for it to appe…
Best meme format
Media from tweet 1366919905036038146
Thanks for this @levelsio!

Would be awesome to also be able to adjust home prices by average wage (as well as S&P500).

My sense is that when adjusted by S&P500, housing, healthcare, and education are close to flat. When adjusted by avg wage, they’re increasingly unattainable. x.com/levelsio/statu…
Dimensions of engagement: seeing, hearing, speaking, moving, touching/manipulating.

You can map podcasts, books, phone calls, sports, video games, etc onto which of the above they require.

Most engaging: video games with voice chat.

Some combinations haven't been tried yet.
Remember, you can make sure that your argument is correct simply by repeating it a little louder.
AWS GPU instance prices. The A100 vs 32GB-V100 price difference is strangely small, given that A100 should be much faster than V100!
Media from tweet 1364634661943615493
Like code, human psychology has bugs. Unlike code, you can't eliminate them. Best you can do is avoid the bug-causing inputs.
RT @benthompson: The anti-nuclear movement is up there as one of the most destructive of all-time. The triumph of emotion over rationality,…
Are you really programming if you're not flipping individual bits on a hard drive with a tiny magnet, by hand?
Things that are impressive are always hard to do at the time that they are impressive.
Best almond butter of all time award goes to @grounduppdx, and second place is not even close. We're talking Standard Breakfast!
Media from tweet 1362465753878781953
RT @full_stack_dl: 1/🛠Tooling Tuesday🛠

Let's talk about an open source absolute unit: �@huggingfaceace provides open-source implementations…
RT @full_stack_dl: Live on our site: materials from week 3 of CS194-080: FSDL.

Continuing our deep learning review, this week we talk abou…
@niallmullensays @AboutDev @MikhailShilkov Thanks! Let's assume the highest setting available for Lambda (highest ram/cpu), and no work done in the function itself. All that changes is the size of the container image.
@niallmullensays @AboutDev Thanks, great to see!

> "Because the plain language model is already around 250 MB, the initial function run can take up to 25 seconds and may even exceed the maximum API timeout of 29 seconds."

Are you saying that cold start time is proportional to size of the container image?
@tomaspueyo Incredible map. Area heights exaggerated in some way? Seems almost like they’re on a non-linear scale of some sort.
@nickcammarata Do you know if there’s research into its prevalence?
@nickcammarata I assume the term means self-talk with an emotional valence?

(Which I became aware of as something that people had when someone posed a question like “What does your inner best friend say, and what does your inner enemy say?”)
My goal as a manager is to make my direct reports cry (tears of joy).
There will be a new planet-wide "country" that people can join using an app. They pay "taxes" to it in return for some benefit, such as visa agreements with other countries.
A little snippet into what our laws look like. How do you feel when you see stuff like this?
Media from tweet 1360278062596820996
Always astonishing to see these two charts. No wonder we can't build anything anymore.
Media from tweet 1359967958496378886Media from tweet 1359967958496378886
@DescriptApp is absolutely magical to use! Currently the best example of AI-first product UX.
A better way to phrase corporate cultural values like “Nothing is someone else’s problem” is “If you solve other people’s problems, you will succeed at this company.”
@fulhack Definitely in browser (or in JS app like VSCode connected to the cloud). Such a pain setting up all the dependencies and keeping them in sync with changes, etc.
We should expect adults to "grow" as much as children do. The phrase "grow up" as if into a final state is wrong.
RT @full_stack_dl: FSDL week 2!

Convolutional neural networks and computer vision started the deep learning revolution, so our course star…
Media from tweet 1358574753670328321
There are also like three different radar detector ads.

Overall, August 1987 seems like a fine month to have been born in.
There are like five more, but let’s skip ahead to the worst one: Vantage.

What is this emphasis on “performance”? Why would I call a 1-800 number to get a single free cigarette? Why remind me of tar? Hard pass.
Media from tweet 1358481646425579520
In fourth place, Newport. Is it a seaside vacation town? Is it a cigarette brand? Why are the fonts inconsistent? Doesn’t matter. Newport.
Media from tweet 1358481638666117121
Coming in third, the apparent yuppie choice: Benson & Hedges.

Not totally sure what’s going on, but I’m into it.
Media from tweet 1358481631699300354
Second place goes to Camel.

Similar vibes as Marlboro, but for more of a lone wolf type? Brand a little less strong. Guess that’s why they have to show what their pack looks like.
Media from tweet 1358481625143681024
On a whim, I bought a Playboy magazine issue from August 1987, my birth month.

SO MANY cigarette ads in it! Might as well rank them.

In first place, Marlboro. Makes me not only want to start smoking but move to Montana and become a cowboy. Now that’s good branding.
Media from tweet 1358481617233141760
Rephrased: let's incentivize investors to improve strangers' educational outcomes.

Colleges too often saddle people with nondischargeable debt for worthless degrees.

Investors should identify the best nursing, coding, trades programs and make $$$ funneling people through them. x.com/sergeykarayev/…
It should be way easier to invest directly in people.

Let's say someone wants to switch careers, and needs 6 months of training.

It should be more easily possible for them to convince individual investors to cover expenses in exchange for share of increased earnings.
A couple of big boys perched up on this tree behind our house today. Good omen?!
Media from tweet 1357175886919856130
@GoAbiAryan Yeah, I can imagine kids having a choice between

(A) formulating their own project vision and convincing others to join them (being a lead), or

(B) joining a project they like (but they must be selected by its lead)
@CharlieYouAI Right, it's not what the at-will employment world is like. (Maybe the military is like this, though.)
The image below is 100% relatable, but I think group projects should be a much *larger* part of education.

Except: each group should have a leader, with ability to "fire" others, and members should be free to leave and start their own groups.
Media from tweet 1357039709294264322
3/ He spoke of feeling like he found a hole in the fabric of reality.

A portal where if he stepped through, he wouldn't have to continue believing that he was Jim.

What a trip to realize that your own identity is only fictional reality, and you can just stop believing in it.
2/ Importantly, fictional reality entities are just as real as tomatoes -- but only as long as people believe in them.

I was reminded of this when watching the documentary Jim & Andy, where Jim Carrey essentially lost himself while portraying Andy Kaufman.
Media from tweet 1356684983046348800
1/ @harari_yuval makes a tremendously useful distinction of a third type of reality: Fictional Reality.

Objective Reality: tomatoes, sunlight, death

Subjective Reality: feeling pain

Inter-subjective (or Fictional) Reality: money, human rights, the United States
Excited about my friend @bictolia -- experienced engineer, manager, and lecturer -- experimenting with new modes of learning online! If interested, fill out the form at everybodystudy.club!
RT @andy_matuschak: No one's yet made a workable solution for web micropayments, but one aspirational design metaphor I like is an electric…
Search Yelp for primary care and you'll see that awful experiences are the norm, allowing things like this to happen.

The market is somehow prevented from producing good patient-doctor experiences -- is it the central planning of the # of doctors by the AAMC? x.com/olgakhazan/sta…
Guys, would you mind pumpin' those stonks a little more quietly? I'm trying to take a nap here.
AI *research* community: alternatives to the scaling hypothesis

AI *product* community: taking responsibility for your model's behavior

AI * thought leader* community: growing your newsletter audience 😅x.com/AndrewYNg/stat…P
For most of history, people lived their lives vertically, with reference to a Heaven above and a Hell below.

Now we live our lives horizontally, with reference to a past that we can repair or extend, and to future generations for whom we may make a better life.

- @adamgopnik
@parthopdas I think you point to one thing one can do after adopting this view: training oneself out of acting impulsively when emotionally affected, and instead waiting until free will replenishes a bit :)
Free will is not binary, it's a spectrum.

When your loved one's life is in danger, you have no free will. When you're picking out a wall color, you have a lot of free will.

In daily life, "free will" is like a limited resource. There's less of it when you're hungry, angry, etc.
Incredible. I had no idea that immigrants from Africa and South Asia were trying to get to the United States from Colombia by crossing a jungle on foot! x.com/nadjadrost/sta…
@Austen It's just incredible that this is who's covering technology.
@gwern Since "better" is a morally defined term, I agree with what you say if you replace "better" with "wealthier."
@minney_cat @Benioff The business model innovation you describe was shifting from buy-a-unit to buy-a-subscription.

The next business model innovation may well be shifting from buy-a-subscription to buy-a-unit. Frictionless per-unit payments?

Bundling and unbundling being the only two ways, etc.😬
Awesome writeup of making remote teaching more engaging and ✨FUN✨

Super inspirational for us at the Berkeley Full Stack Deep Learning course, which began yesterday (bit.ly/berkeleyfsdl)!

Remote interaction is still in the super-early stage. Zoom is not it long-term. x.com/kayvonf/status…
@kimmytaylor For what it’s worth, a study by @whynotyet found that learners prefer seeing the speaker’s face in the corner, but that it has no effect on info retention.
Media from tweet 1349757095906721793
RT @terronk: Project management for data science often tries to borrow from software engineering practices. Agile, estimation, sprints.

Bu…
Just had a premonition of @trvisXX releasing a track on the blockchain
Caring is a superpower.

But not caring is also a superpower.
@V0R0N01 What if we had an FDA but also a Virginia DA, California DA, etc. CDA could approve something for CA residents, but many CA residents could still choose to follow FDA rules instead.

Just trying to think how we can fix some bottlenecks Covid has exposed...
Could a US state have bypassed the FDA and start vaccinating volunteers back in March?

Or quickly run their own challenge trial (intentional virus exposure)?

If not, then why not? If yes, then why did no state do this?
(3/3) Seems impossible to participate in a modern information economy with 0% of your graduates able to solve problems like this.

And can you imagine how much easier it would be for a country to have 20% rather than 2% graduates operating at this level?
(2/3) Only 2% of US high schoolers are able to do math at this level.

4M people graduate per year in US, so only ~80K are at this level.

OECD average is 3%. Singapore is at 18%, Japan at 8%, Germany at 5%, lots of countries at 0%.

oecd.org/pisa/test-2012…
(1/3) PISA sample highest-level math Q:

"Helen traveled 4 km in 9 minutes. Then she traveled 3 km in 6 minutes. What was Helen's avg speed, in km/h? ___"

Want to guess % of US 16-yr-olds that answer correctly?
Guess people know how to buy now
Media from tweet 1345521789674262528Media from tweet 1345521789674262528
Wouldn't it be fun to take a high school course where you produce media in a bunch of different forms:

- Sonnet
- Stage play
- Short story
- Factual article
- Opinion essay
- Slide deck
- Youtube video
- TikTok video

Is that what the kids are doing??
how it started: how it's going:
Media from tweet 1344704571042500610Media from tweet 1344704571042500610
How do I get Jon Ossof to stop sending me emails
The Hound's Tooth

- 1 part Japanese gin
- 1 part Disaronno liqueur
- 1 part lemon juice
- Garnish with a lemon twist

Maj. Thomas Snow would order his favorite cocktail before dinner at the Burmese Officers Club in the 30's. Best with peanut and tamarind sauce dishes. x.com/sergeykarayev/…
I guess the "extraction" can refer to tech workers "consuming" city coolness but not contributing coolness back, because they and their patagonia vests are so lame. Or perhaps to South Bay companies not paying taxes in SF where a lot of their workers live (and do pay income tax). x.com/sergeykarayev/…
Christmas Idea: Cocktail review website where the history of each cocktail is totally fake and some cocktails are just made up.
Merry Christmas!
The point of investing in a diversified portfolio of stocks and bonds is to own the entire economy instead of trying to pick winners.

But the public stock market is far from the entire economy, right?

Does a VTSAX/VBTLX portfolio include small businesses or startups in any way?
@AboutDev @awscloud Thanks for the correction Sushant! Now please release this in Canada and Australia so that I can actually use it :)
Thinking about live-streaming petting my dog
Great passage from Tender is the Night by F. Scott Fitzgerald
Media from tweet 1336770271919833088
@JacquesThibs Interesting to see France rank higher than US! Anecdotally, SF and NYC are full of ambitious young French people who move to US for the early part of their career. Haven’t heard of anyone from the US move to France like that.
Some amazing @awscloud Lambda announcements this week for ML deployments:

- You can now deploy Docker containers, up to 10GB size! So excited to stop hacking OpenCV and Tensorflow/Pytorch to fit into a 250MB zip package.

- RAM limit is now 8GB, with 6 vCPUS

- AVX2 support 🏎
@antoniogm People with a line of ancestors who were wealthy-enough often don't seem to understand that wealth (and peace) can be created and destroyed.
Seems that Amazon has a negative impact on brick-and-mortar businesses, but a positive impact on online businesses and small manufacturers. Is there a good analysis of how the two balance?
AND: it doubles as a diary of a sort. I've been using NotePlan consistently since May 2018, so I can tell you what I did on any workday since then.
NotePlan is the only todo-list software that has made sense to me.

- automatically get a page per day
- it's all Markdown and the text editor is decent
- you can use non-dated notes if you want to organize future tasks
- and now, your calendar is right there in the sidebar

👍x.com/NotePlanApp/st…s
This past Spring, I was fortunate to teach Full Stack Deep Learning materials to a university audience at UW.

Next Spring, I will be working with Josh and Pieter to teach it at Berkeley!

There are many updates, so if you're interested in following along, check out this thread! x.com/full_stack_dl/…
Idea: camera on a robotic tripod that can track your face, follow you around, and take basic directions. So you can prance around while on a video call but the video will be of just your face :)

I guess these things kind of exist? wiki.ezvid.com/best-robotic-c…
We could bring back togas while we're at it
Media from tweet 1327295149718659074
Instead of professional politicians, we should have year-long appointments for random citizens, just like we do with juries.

If it's a good enough system for deciding life-or-death, it's a good enough system for approving a budget.

This thought brought to you by @SeattleCouncil
“It is the chase that heats up the great mob. And the fact that the chase is unjust only tickles them the more, for to do injustice with impunity is a sign of power, and power is the thing that the inferior man craves most violently.” - HL Mencken
Shout out CS beta 6.1 cs_militia, what a map!
Media from tweet 1323710241289072640
Who are the samurai of our society?
RT @michelletandler: This is a must-read.

Vivid depiction of the drug encampment situation unfolding on our streets.

+1 that our city's…
Shout out to Black & White, Game of the Year 2001
Media from tweet 1317214327405834241
ML question: I want a classifier to predict only when confident.

I can train with usual loss, then find conf. threshold that maximizes some metric.

But it feels like I should use a better loss, to allow for "abstaining."

I found openreview.net/forum?id=rJxF7…, is there anything else?
@kheimerl @ArminSamii I was thinking more in the psychological sense, like an object that makes you the opposite of anonymous
Word request: the opposite of a mask
At night, said Tobin, when the horses are grazing and the company is asleep, who hears them grazing?

Don't nobody hear them if they’re asleep.

Aye. And if they cease their grazing who is it that wakes?

Every man.

Aye, said the expriest. Every man.
@wmhaddad @BeeSimulator Yeah, my dog always tries to eat bees. It's a very small dog by human standards, but we can only imagine how huge she is to a bee
Media from tweet 1314256414299230209
Game Idea: you are a bee! You fly around incredibly beautiful flowers (bees see UV light), gather different types of nectar, avoid huge dogs trying to eat you, super scary wasps, etc.

HOLD UP: it exists! @BeeSimulator about to deliver on this vision!🌼yeah!
You know how they would drown a suspected witch to see if they were in fact a witch? There should be a word for solutions like that
RT @Pinboard: Here's a list of 49 state house races where your money, right now, can make a difference in the November election: https://t.…
Game idea: platformer where you're one of those peeing boys, and you solve puzzles by peeing into buckets and onto people and stuff
Idea: digital globe. You know, like a real globe but it can show all kinds of maps.
Mood
Media from tweet 1310982695049076737
Art idea: Roman style busts but of contemporary people, like Paris Hilton or Steve Buscemi
Free startup idea: Zoom, but for dreams
Idea: game where you play as a fly and try to influence the course of events in a human household. Like, prevent a fight, lead a person somewhere, etc.
"A thousand half-loves must be forsaken to take one whole heart home."
RT @asugsvsummit: The Summit is now virtual and FREE to all in order to create access to #education & #tech for all.

We’re focused on pro…
RT @Pinboard: I'm super happy to announce we have a $10,000 matching challenge for the Great Slate—five rural House campaigns bringing in e…
These beliefs are implicit and unexamined.

In fact, upon reflection they would agree that wealth is not static, and that peace is not a given.

But they don't feel this deeply in their core, as do all of those who uproot their families to seek an honest life on foreign shores.
I now notice a fundamental worldview difference between the average native-born American and the average immigrant.

The native-born American seems to believe that wealth cannot be created or destroyed, and that peaceful society cannot come undone.

theatlantic.com/ideas/archive/…
Media from tweet 1301210331570634753
“The nature of rain is the same, but it makes thorns grow in the marshes and flowers in the gardens.” - Anthony De Mello
@oscarbatori Good point, maybe related to trend of people moving out of urban cores.
RT @anafabrega11: School:
- Here’s what you’ll learn
- Here’s what you’ll use to learn that
- Here’s how you’ll show what you learned

What…
Moneyland: Why Thieves and Crooks Now Rule the World by @OliverBullough
Media from tweet 1298665179740164096
But in the minds of many, I would guess most, of people who will see this clip, the bullies themselves look like Nazi brownshirts in Germany circa 1933.

How to get out of this?
This makes me mostly sad, a little angry. What can normal people do to turn the tide at this point?

We’ll have to start with empathy. I suppose that in the minds of the screaming bullies, the US today is basically Germany circa 1937, and peaceful diners are Nazi collaborators. x.com/KunkleFredrick…
@coldxmann: anti-racism should be "grounded in the idea that there is a single human race to which we all belong—and that all the ways of dividing us up, though they may be important to understand our present reality, should not be given moral weight"
persuasion.community/p/a-better-ant…M
@astrosaeed This data is from the United States. HS means "high school," which is the highest mandatory educational level. BA means Bachelor's, roughly four more years of education (also called "college," "university," "higher education."
This is one of the most striking plots I’ve ever seen
Media from tweet 1295060547562397696
RT @malicannoyan: I just completed the @full_stack_dl.

There're many courses on ML, but few on production. So far I learned production fro…
Kids have the neighborhood ice cream truck. Adults should have a 1968 El Camino driving around the neighborhood, blasting Creedence Clearwater and tossing out ice-cold cans of Coors!
I am lying on the grass and pointing a laser beam up into the sky. It is night time. The air is clear. I don’t think the beam has an end, until it hits a star.

I am balancing a star on a beam of light, lying on the grass in the clear night air. That’s enough to spin your head.
Aggregating and processing data from disparate sources to get it ready for training on the GPU can be a challenge.

@ApacheAirflow is great for flexible Python pipelines.

But: keep it simple as long as possible. Can simple parallelization do the job for now?
Media from tweet 1285628511181516804
Versioning

To reproduce model training, data must be versioned.

Level 1: just store a snapshot at training time
Level 2: use version control (git-lfs)
Level 3: use specialized tools like @Liquidata1 or @DVCorg.

Get to Level 2 ASAP; move to Level 3 when you can explain why.
Media from tweet 1285628508249763842
Storage

The building blocks:
- Filesystem
- Object storage (e.g. S3)
- Database
- Data lake

Put the right things in the right place! Example: images in S3, labels in a DB, and processed, ready-to-train data in the local filesystem. Learn from @intensivedata!
Media from tweet 1285628504629968902
Labeling

Spend a day labeling data yourself to see enough edge cases and write clear instructions.

Then, outsource to company like @FigureEightComm, or at least use purpose-built tools like prodi.gy from @explosion_ai with your own annotators from Upwork.
Media from tweet 1285628501849264128
Sources

Defensible AI product == proprietary data.

Typically, have to spend $$$ to label. But there are three judo moves:

1. Semi-supervision: label data with itself

2. Augmentation: mess up your data, aggressively

3. Synthetic data: generate your own!
Media from tweet 1285628498758045698
Here's something all ML practitioners know: data is AT LEAST half the job. (h/t @kscottz @mat_kelcey @vboykis)

In my lecture for @full_stack_dl, I break down data management into Sources, Labeling, Storage, Versioning, and Processing.

Thread 👇C
Media from tweet 1285628494316281856
RT @carranzadanielh: This is one of the greatest courses in Machine Learning, taught by incredible people!
If you want to be a great ML Eng…
RT @josh_tobin_: What makes production ML hard?
- Cleaning, labeling, and augmenting data
- Troubleshooting training and ensuring reproduci…
@ikirigin Closest we've found is productboard.com. Not endorsing, since we haven't tried it, but the only app that checks your boxes I'm aware of.
RT @scheplick: I often forget how young the founders of the United States were.

Thomas Jefferson 32
George Washington 43
John Hancock 39
J…
RT @coldxman: In @CityJournal, I sum up my view of the BLM movement: they're right about several important things, but wrong about the alle…
Just submitted final grades for @UW Full Stack Deep Learning. Had a great time teaching 80+ motivated @uwcse professional master's program students! Materials online at bit.ly/uwfsdl
RT @Suhail: Increasing the surface area of a product is the most subtle way companies move slower. That’s why it’s so important to be clear…
I would add point 15: American police have become overly militaristic and antagonistic toward the people they serve. We are not "civilians" we are their employers. x.com/HeatherEHeying…
“The law, in its majestic equality, forbids rich and poor alike to sleep under bridges, to beg in the streets, and to steal their bread.” - Anatole France
Murder mystery but set in caveperson times 😮
"...we still amassed a net increase of 145,000 people over the course of the decade. That adds up to a remarkable 23.8% growth rate. And with that, it’s official: Seattle ranks as the fastest-growing major city of the 2010s." - seattletimes.com/seattle-news/d…
Media from tweet 1263581078255628288
@Suhail Here's the deal: Google is now Alphabet, Netflix just makes TV shows, and Microsoft is #1 most valuable company in the world. It's FAAAM, fam!
RT @DeepMind: Want a hands on approach to bridging the gap between training machine learning models & deploying AI systems in the real worl…
RT @BretWeinstein: It is insane that, as the evidence for Vitamin-D's protective effect mounts, and though it appears the virus is very rar…
@AaronHertzmann Thanks for the link! Huge difference between immediate donation and charitable LLC. Debating individual charity vs higher taxes is totally valid. I just don't like article's presumption that everyone is a fraud and nothing can be as good as giving SF more tax $...
@AaronHertzmann Not sure it's possible to meaningfully deploy $1B in less than a few years... I **kinda** get the worry about charitable LLCs (but not really, that money is def committed to charity)? But the overall tone of that article is exactly what's repulsive about a lot of tech journalism
Seeing lots of opinions about how real estate prices will behave over the next months and years. Is there something like a futures market for real estate, so I can see what people are actually betting their 💰on?
Jack Dorsey's low key $1B donation (a third of his paper wealth) went by kinda underappreciated, right? t.co/bG8dk6On7t
@saams4u Yeah, a healthy ecosystem needs to be in balance. If one species gets too good at surviving and competing, the whole thing can collapse. Viewing society as an ecosystem, are laws are our rules for survival and competition.
@AmateurMathlete Bad things! Maybe in my analogy of an ecosystem, that'd be like humans getting way too powerful, killing all animals except for chickens and cows, and planting only corn and wheat...
the same pest problems of any monoculture. A capitalist should want to get way too rich, and a public servant should want to put him in jail for insider trading. What do y'all think? 2/2
Thought: a healthy society is formed by people with different moral inclinations, like a healthy forest is formed by plants, bugs, deer, wolves, etc. If everyone shared the same moral desires, we'd be a monoculture corn field, and subject to 1/2
RT @tomaspueyo: This chart shows how much of a no-brainer masks are. They *could* single-handedly stop the epidemic.
Media from tweet 1253348604670095360
RT @balajis: Most studies of COVID-19 treat each patient as a row in a dataset. It's another thing to hear from them yourself.

The https:/…
RT @mattparlmer: Still stand by this. We need to start putting anybody who can't WFH in appropriate PPE and back to work. Even retail and r…
I ❤️ this. The federal government should matter much less than it does. In our current crisis, all it did (and seemingly is still doing) is put up roadblocks to an appropriate response. Happy to be represented by @GovInslee, happy to see more governors on the national stage. x.com/GovBobFerguson…
Incredibly striking graph of the Obama era. Did not realize that this happened. This article is full of these nytimes.com/interactive/20…
Media from tweet 1248669240992739328
RT @MacaesBruno: How do these people manage to come back on air? I would be too embarrassed
Media from tweet 1246189852325900288
RT @KyleTibbitts: @naval The FDA would rather health workers wear no mask than one they haven’t approved. They should remain deregulated—FD…
RT @BillAckman: @realDonaldTrump Mr. President, why don’t you launch the biggest infrastructure program of all time now? Roads, bridges, an…
RT @webdevMason: "Masks don't work for a highly infectious respiratory illness, but also please save them for health workers so they don't…
RT @balajis: Growing consensus for total rethink of bio regs. Start with expanded right-to-try.

Because FDA failed. Test delay turned cont…
RT @DellAnnaLuca: If the virus were in charge of the government, it would:
- heavily regulate test kits production
- make sure that people…
RT @USArmy: Lt. Gen. Todd Semonite, Chief of the @USACEHQ, provides a 'simple' solution to the complicated problem of building temporary me…
RT @nntaleb: Explain to me why we should spent taxpayer money to bailout companies (airlines) who spent their cash buying their own stock s…
RT @EricLiptonNYT: This is really shocking. We live in a nation of such wealth. And we have effectively run out of such a basic protective…
Prices seem to be about $1 per mask of either type, with surgical masks a little cheaper, for an order of about 10K masks. Not yet sure about shipping rates and speed.
While it seems that respirators would be better, it seems that in practice surgical masks are equally effective for preventing infection of the wearer: smartairfilters.com/en/blog/n95-ma…
The first distinction is between surgical masks and respirators. Seems that the former are rated for *exhalation* filtering, the latter for *inhalation* filtering. Some helpful info: smartairfilters.com/en/blog/compar…
It's embarrassing that US hospitals have face mask shortages. Spent some time looking into possibility of ordering from China to help out. Here are some links so far:
@datarade One of the fanciest restaurants in Seattle was ahead of the curve: closed its dining room before mandated, and announced well-branded drive-through options: canlis.com
RT @jburnmurdoch: The reason I make this chart is to get across the inevitability of coronavirus:

All western countries are on the same tr…
RT @MaxCRoser: 1/ Many of you ask me why I take the COVID-19 outbreak so seriously.

Current numbers of cases and deaths are *not* why.

👇…
RT @Farzad_MD: 1/ I'm very worried that we don't have a clear strategy for #COVID19 response

We need to clearly define when the public hea…
RT @sethbannon: American Hospital Association "Best Guess Epidemiology" for #codiv19 over next 2 months:

96,000,000 infections
4,800,000 h…
RT @biden4pres: Let’s get me elected, just for the hell of it
RT @Pinboard: It's Friday night in America. If you're feeling overwhelmed by this week's political and national news, there's no shame in d…
Really glad there's not going to be a 100K person gathering in Seattle next week! SXSW canceled too. x.com/emeraldcitycon…
RT @le_james94: bit.ly/2IsrI5k

Deploying ML models into production is a complex affair. In this post, I'd like to share the best…
😮“Like oysters forming lustrous pearls around irritating grains of sand, overburdened TAs and faculty are seeking succor in algorithms, artificial intelligence, and machine learning to automate the tedious job of grading.”asee-prism.org/the-drudge-ret…0 thanks@MaryLordDCC
RT @bradneuberg: Sir Isaac Newton wrote his ground breaking Principia while fleeing the plague, waiting it out at his family farm while his…
"It is easier to protect your feet with slippers than to carpet the whole of the earth." - Anthony de Mello
RT @dril: the wise man bowed his head solemnly and spoke: "theres actually zero difference between good & bad things. you imbecile. you fuc…
RT @oren_cass: 13/ COTI shows that while the nominal median male wage rose from $443 to $1,026 from 1985 to 2018 (132%), the expected cost…
RT @karldray: @sergeykarayev Start your Bernie Journey or head over to Pete Street or put on your Klobe Robe or get into the Liz Biz
A narrative seems to be settling in that Trump has been actively good for the economy. Much more fair to say that Trump has been merely not bad. And that can be said only if you ignore the quickly growing deficit. @davidfrum on that and more in theatlantic.com/ideas/archive/…
Media from tweet 1225169745571459072
Great article by @ballmatthew! The question for me is how video games can be a communal experience not just for the player but for people sitting next to them, in the same way that TV/movies are. matthewball.vc/all/7reasonsga…
“First we shape our tools, thereafter they shape us.” - Marshall McLuhan
Has anyone seen a map of the United States with each precinct or county colored by the average number of generations from first immigrant?
What's up with people still saying FAANG? Google is now Alphabet, Netflix just makes TV shows, and Microsoft is the 2nd most valuable company in the world! It's FAAAM, fam!
@minrk d) do it any old way and wait for shell heck to yell at you how to do it properly
RT @zoeschlanger: Morning, today I published a story on the insane fact that only 9% of all plastic produced gets recycled, and the vast ma…
RT @xsteenbrugge: Emtremely excited to finally share a side-project I've been working on for the past few months: Neural Synesthesia, visua…
Idea: weather forecasts are sometimes high-confidence for 10 days, sometimes only for 2 days, and sometimes not even for the next day. The forecast UX should reflect the level of confidence visually
This guy chops!
Media from tweet 1188160897585307648
RT @superamit: Google Calendar feature suggestion: Blind Cancel.

If you blind cancel a meeting it sends no notifications and the meeting i…
Haven't seen this before: adversarial noise in the recaptcha images!
Media from tweet 1182724656551251974
RT @ch402: Do you formally know Monte-Carlo and TD learning, but don't intuitively understand the difference? This is for you.

https://t.c…
RT @pabbeel: Want to take your Deep Learning to next level w/real-world deployment in mind?

Excited to announce Edition 3 of the Full St…
I really like this blog post format of photos interspersed with text, like a friend showing you photos from their trip and explaining why they're interesting granolashotgun.com/2019/06/27/the…
RT @kevgski: The Gradescope Team is looking for someone to help lead our engineering efforts. If you are interested in education and want t…
RT @josh_tobin_: 1/ Figuring out whether your deep learning project idea is feasible is hard. Some ideas on how to think about it:
Here's a 5-min nibble of full stack deep learning for you 🤓 Infrastructure Landscape for Full Stack Deep Learning from our March 2019 bootcampyoutu.be/a33nwW4-rn8l via@YouTubee
RT @levie: Apple has cemented a turning point for the future of the web. In building products, the question is moving from “how much data c…
@TaraChk⁩ summarizing the excellent AI in Education sessions at #AIforGood. There’s so much to do in the space!
Media from tweet 1134080033142632448
RT @Turnitin: We’re excited to participate in the #UN AI For Good Summit this year! Our Gradescope co-founder @sergeykarayev will be speaki…
RT @planarrowspace: The Testing & Deployment talk with @sergeykarayev at the Full Stack Deep Learning bootcamp is an eventful two hours: ht…
Very honored to receive the Early Career Diamond Award from UW Engineering! All thanks to my co-founders Arjun, @pabbeel, Ibrahim, and the whole Gradescope crew, our investors, my research advisor @trevor_darrell_, many CS professors a UC and UW, my parents, and my fiancé Nico!
Media from tweet 1129823490364923905Media from tweet 1129823490364923905
RT @pabbeel: The Full Stack Deep Learning Bootcamp was a lot of fun in person, but of course not everyone can make it in person. Very excit…
RT @BillGates: If you believe innovation is for everyone, then I have a list for you: 10 challenges that the world needs your ideas to help…
Excited to announce that Turnitin has found a permanent home with Advance, a conglomerate that owns Condé Nast, Discovery Channel, and Reddit :) wsj.com/articles/advan…
RT @vboykis: New blog post: For the past couple years, I've been telling people who ask me for advice not to go into data science. Here's w…
RT @Smerity: Before doing anything intelligent with "AI", do the unintelligent version fast and at scale.
At worst you understand the limit…
RT @roybahat: SF has gone from startup hub, to startup headquarters-and-maybe-engineering hub, and risks soon becoming just a startup fundr…
RT @kscottz: At this point I consider my job to be 95% building tools to scrape, mine, move, annotate, review, and preprocess data sets, an…
Very impressed with the @cs50 office and software such as sandbox.cs50.io. Thanks @davidjmalan for the tour! It's interesting and frankly inspirational to view college courses as multi-media software companies that "ship" both online and physically.
RT @josh_tobin_: 1/ Excited to share something I've been working on for a while now. Troubleshooting Deep Neural Networks: a decision tree…
Hosting last summer’s bootcamp was incredibly fun and rewarding, thanks to the smart, enthusiastic attendees. Looking forward to two more this March! x.com/pabbeel/status…
RT @full_stack_dl: Meeting enthusiastic students and engineers at the last Full Stack Deep Learning bootcamp was an amazing experience. Sta…
RT @AustenAllred: New experiment from Lambda School, coming 2019!

* Housing in downtown Salt Lake City for 1 yr

* $500/month food/living…
@scheidegger @ramhiser @pwang Carlos, agree with you re: multiple choice! We actually built @gradescope exactly so that we could grade free-response questions at scale -- it's not for multiple choice. Check out the attached screenshot for an example question/rubric (there are more on gradescope.com)
Media from tweet 1059997365350805504
RT @jamescham: Asking software developers to manage ML models is like asking the calvary to manage tanks. The basic moves are different, th…
Brilliant lectures by @jiayq and @l2k on the last day of the Full Stack Deep Learning bootcamp! It was an honor to host such a fantastic group of learners. @pabbeel, @josh_tobin_, the @gradescope crew and I are very thankful to everyone who attended!
Media from tweet 1026269496686731265
RT @kevgski: Woo shipping stuff is always fun! Heres a video I made to try and communicate what AI assisted grading means: https://t.co/eIA…
RT @AndrewYNg: With so much information now online, a strong work ethic and growth mindset, even more than knowledge, predicts your future…
RT @gradescope: Data we analyzed from 1500 exams shows how little the average captures student performance. #EndofAverage https://t.co/GiEJ…
RT @gradescope: Very excited to be named among "the 100 most promising private AI companies globally" on the AI 100 2017 list. Thanks @CBin…
@goodnts we are using openiv to train artificial intelligence at berkeley. поговорим?
#dev #matplotlib export transparent figure but with white bg inside axes: plot(randn(20),randn(20),'o'); savefig('t.png',facecolor="none")