Chana likes this.
Owain on AXRP!!!
Earlier this year, the paper "Emergent Misalignment" made the rounds on AI x-risk social media for seemingly showing LLMs generalizing from 'misaligned' training data of insecure code to acting comically evil in response to innocuous questions. In this episode, I chat with one of the authors of that paper, Owain Evans, about that research as well as other work he's done to understand the psychology of large language models.
Ben Weinstein-Raun likes this.
like this
like this
Ben Weinstein-Raun doesn't like this.
Update: There are several minor-ish annoyances with LibreWolf:
- (as with probably most non-big-boy browsers, I think), it doesn't seem to support Widevine, which means you can't use some streaming services, and others don't support HD video.
- Google Maps zooming, which is normally smooth in most browsers, is jerky and a little annoying in LibreWolf
- Some other webapps use maps libraries that also don't seem to work well (e.g. I can't see the DoorDash delivery map)
- You can't easily add Google as a search engine; it seems to have a special case where it will refuse to add a custom search engine named "Google"; you have to call it something else (!). This seems like a very weird / user-hostile choice, but you can still add the search engine as long as you call it something else (e.g. "G" or "Google Search")
I'm going to keep using it, because I find these issues less annoying than upstream Firefox.
New AXRP episode with Lee Sharkey!
What's the next step forward in interpretability? In this episode, I chat with Lee Sharkey about his proposal for detecting computational mechanisms within neural networks: Attribution-based Parameter Decomposition, or APD for short.
I think the thing I really like about LLM-assisted coding is that it makes context switching easier.
I can be in "words mode" or "code mode", and switching between these takes time and effort. (There are more categories, but they don't change the fundamental point.)
In my job, I have to spend a lot of time in words mode, due to things like hiring and managing. Historically, this has meant that I only really get engineering work done when I have 2+ hour chunks to focus on it. But now I can often get work done in much shorter chunks, while still in words mode.
I would not like to spend all my time in words mode – I enjoy digging into the details – but it's really nice to have the option.
like this
Ok my new beliefs about blister prevention, after three weekends of backpacking for eight hours a day, and watching a bunch of YouTube videos:
- blisters are caused by layers of skin delaminating, not "friction" / heat directly, though typically the delamination is due to static friction on the outer layer of skin, combined with wet skin. Dynamic friction is more likely to cause raw spots / wear straight through the skin.
- popping them as soon as you find them is basically always the right call unless you plan to be able to avoid the activity that caused them for a week; otherwise they just keep growing as you continue to do the activity
- blister donuts and moleskin work okay as long as you can keep them in place somehow, but they don't stick well on their own
- leukotape, very very widely recommended, is worse than useless because the adhesive seeps through the tape and makes your skin stick to your socks even more tightly than it was before.
- toe socks are pretty good
- KT tape is very good
- Vaseline/similar is pretty good as long as you can get it to stay in the right spots
like this
Anyone have luck getting LLMs to write tests without mocks? The tests I want are often just 1-2 lines of code, but anything I get from Claude or Gemini ends up being 20-30 lines long, despite requests for conciseness, saying no mocks are needed, and seeing using real resources is ok.
(I use LLMs a lot for other stuff, but tests seem to be particularly bad.)
like this
Satvik likes this.
Sonnet 4 is tremendously more effective for my use cases, probably because I use a niche programming language (Julia). Two weeks ago I would have said LLMs make me ~10% more productive, now it looks closer to +100%.
And I'm not even committing LLM-generated code – I just use it to iterate and test on designs, then delete the code and implement from scratch manually.
so, I'm going to japan in a few weeks, to do this pilgrimage backpacking trip with a friend.
I'm very out of shape compared to the difficulty of the route (alltrails.com/explore/map/map-… : 4 days; average of 10 miles and 3200ft elevation gain)
So my plan is to train as much as I can between now and then. I've figured out this practice loop, starting from my house, that I'm going to try to work up to doing on both the 17th and 18th: alltrails.com/explore/map/kuma…
It takes a pretty cool path over the hills and down to the reservoir.
Anyone want to join for any of this? As you might guess I expect to be very slow and take lots of breaks (today I did only about half of this loop; 6-ish miles; and it took me like 4 hours)
like this
Yo that's really cool. I wish I was in the bay to practice with you.
Consider wearing a backpack on the trek if you aren't.
Ben Weinstein-Raun likes this.
I tentatively think that rain jackets would work better if they were more like coats of feathers.
Usually rain jackets are either (a) totally waterproof, in which case you sweat and it condenses on the inside of the jacket, or (b) "breathable", in which case they fairly quickly "wet out" and the sweat actually still condenses on the inside.
Feathers work partly like a "breathable" rain jacket, in that they're porous and hydrophobic on the outermost layer, but they're also anisotropic: rain jacket material is the same in all directions, while feathers work kinda like roof shingles: The water rolls off, but there's space for air to pass underneath the feathers. This is fine because rain mostly comes from above, and anyway I bet you can make fairly complicated labyrinths of air passageways such that even splashing water is very unlikely to make it through the jacket.
like this
Daniel Ziegler likes this.
like this
Went on a photo walk today, mostly around Berkeley, and tested out some new camera settings. Most of the photos didn't turn out as well as I hoped, but I got a few that I like after a little postprocessing in Lightroom.
flickr.com/photos/spiritfox/54…
like this
Okay, what the heck is up with people doing deceptive things to prevent "panic"? What are the actual dangers of "panic?" I was just watching this new Veritasium video about an engineering firm discovering that their already-built Manhattan skyscraper has a 1 to 5% chance of collapsing per year, and deciding that they're not going to tell anyone about it while they spent months fixing it. The head engineer explicitly says in a recorded lecture that this was justified because people "don't need to be terrorized". Is that even plausible?
- YouTube
Enjoy the videos and music you love, upload original content, and share it all with friends, family, and the world on YouTube.www.youtube.com
like this
Ben Weinstein-Raun likes this.
like this
I tried out making an unboxing video:
I'm pleased with how it turned out, though the subject matter is objectively not very interesting.
JP Addison likes this.
like this
openAI finally added this capability to the API, which is why I am only now playing with it, because I am much happier paying 30¢ per image I actually generate than paying $20/mo regardless of how much I use
also if you want me to generate images for you I am very happy to do that
Ben Weinstein-Raun likes this.
like this
like this
like this
I've been getting really into pocket knives this week, and especially learning about knife steels. The biggest surprise has been that one of the several aspects of Atlas Shrugged that caused me to lose suspension of disbelief, has become much more believable in retrospect:
When I read it I felt like this whole part about the "guy invents a new metal that's just straightforwardly better than existing metals, and names it after himself" was just too farfetched.
But it turns out that actually this is just a thing that can happen. This guy Larrin Thomas basically straight-up did this with a knife steel alloy in 2021. The alloy is notably better than others for the purposes of pocket knives in almost every respect. Like, in any single dimension there are steels that do better, but this alloy is like the Hawaii of the knife steel Pareto frontier. He didn't name it after himself but I think he could have called it "Larrin Metal" if he had wanted to. He actually called it "Magnacut".
JP Addison likes this.
I guess my intuition is that alloys would be "smooth" in their properties. You add more chromium and certain properties increase. Maybe they stop increasing or start reversing after a while, but it's not hard to find the optional points for each property over time.
With that intuition it seems surprising that it's hard to find new alloys that haven't already been found.
Ben Weinstein-Raun likes this.
Have been wearing minimalist/"barefoot" sandals for the last couple days, and it feels somehow unhinged to say this, but I think they're making me feel noticeably happier?
Like, it reminds me of the thing where anosmia is linked to depression. It's like I regained a nontrivial part of my sense of "what's going on in the world".
like this
AXRP Jason Gross
How do we figure out whether interpretability is doing its job? One way is to see if it helps us prove things about models that we care about knowing. In this episode, I speak with Jason Gross about his agenda to benchmark interpretability in this way, and his exploration of the intersection of proofs and modern machine learning.
Ben Weinstein-Raun likes this.
like this
On the way home from the coffee shop I encountered a bright blue bird as I walked past a bush, about half a foot from me, which stared at me for a second, quietly squawked, and flew away.
Then soon afterward, in a different bush, I encountered an ooze (pictured, made of bubbly foam) trying to stealth.
I'm concerned that someone has changed the genre of my RPG, in a worrying direction. Will be avoiding taverns.
like this
Men's Dang Soft Boxer Briefs | Duluth Trading Company
Softness from above for down below? You bet! Duluth Trading’s new Dang Soft® Boxer Briefs’ fabric is so silky-smooth, all you’ll feel is sky-high comfort!www.duluthtrading.com
like this
kip likes this.
Ben Weinstein-Raun
in reply to Ben Weinstein-Raun • •