Skip to main content



One of the main questions I ask in interviews is basically "we have a data pipeline with goal X and constraints A, B, and C. How would you design it?" Depending on how they do, we'll discuss various tradeoffs, other possible goals/constraints, and so on.

This is based on a real system I designed and have been maintaining for ~5 years, and is also very similar to other systems I've run at previous jobs.

About half the candidates complain that it's not a realistic question.

in reply to Satvik

Is it possible that they mean some other kind of realism? Like, maybe they think it's unrealistic to design a system like this during the course of an interview, or something?
in reply to Ben Weinstein-Raun

I've asked for more specific feedback, and the complaints often come down to "nothing I've done has been like this" and "most of development is web development." That might be true, but we don't have a website/web app, and we're pretty specific about the work involved in both the job description and the phone interview.

(We have had other feedback that's helpful)

Generally, everyone who's done well on this question and joined has been a strong hire, though we've also hired some people who didn't do well that specific question. So I'm pretty sure it's a good question. I'm just a little amused/dismayed at how many people seem to think "realistic" means "web development."

This entry was edited (6 months ago)


Grammar-translation exercises convict you of your sin, but only comprehensible input can bring salvation.
in reply to Daniel Filan

Where of course "sin" means failing to have internalized the full glory of the grammar and vocabulary of the target language.


When I see that the US uses voting machines and takes ages to count votes, my instinct is to support a reform of "just do it like Australia/the UK/other civilized countries", where:
- all ballots are paper
- there's no such thing as provisional ballots
- postal votes have to arrive by end of voting election night
- on election night, ~all votes are counted by hand, observed by scrutineers employed by the candidates
One issue is that this is maybe somewhat of a simplified account of how places count votes (aec.gov.au/voting/counting/). But I also wonder if it's just one of those things the US could not implement if it tried, due to lack of the relevant kind of "state capacity".


I'm currently partway through job application processes at 80,000 Hours and the UK AI Safety Institute. Does anyone have any opinions about those employers?

As someone who is primarily a software engineer and manager / CTO, but not wedded to being those things, if anyone has any other things they think I should apply for or consider doing, let me know? Working independently, unpaid, is also an option.

in reply to Ben Millwood

I think research managing at MATS is kind of fun, applications are technically closed but it's possible they'll continue to consider good candidates. Link to apply here. In my role as a MATS employee I encourage you to apply but make no promises that your application will be considered.


HomeAssistant. Wow. How had I not heard of this before this year?

It's allegedly an open source home automation system, but I keep running into ways that it's actually way better than that. You can connect it to just about anything: Sure, air conditioning and lighting and smart locks and all the other home automation stuff. But also fitbit, mattress coolers, various internet data sources; you can use it to set up a custom ChatGPT-powered Google Home replacement (finally!).

And it's all so polished! Like, yes you need to be a level-1 technical person to set it up, and to use the more advanced features, but the flows are so reasonable and reliable. I'm genuinely kinda shocked.



Basket of Lemons, 1992 - Jose Escofet.
From: https://x.com/MenschOhneMusil/status/1856974702498996385


New short AXRP with Zhijing Jin!


New episode of AXRP with Zhijing Jin - this time, a short one (22 min), offering an overview of her work. Blurb below, links in comments.

Do language models understand the causal structure of the world, or do they merely note correlations? And what happens when you build a big AI society out of them? In this brief episode, recorded at the Bay Area Alignment Workshop, I chat with Zhijing Jin about her research on these questions.

YouTube

Transcript



A theory of how alignment research should work


in reply to Ben Weinstein-Raun

tbc, in this comment I did basically:
[ul]
[li] foo
[ul] [li] bar [/ul]
[li] baz  
[/ul]


RIP Inkscape


I recently installed Inkscape in order to make anSVGs for a puzzle hunt... and managed to crash it in less than 10 seconds just by somewhat rapidly clicking/dragging around to try to start drawing. Some Googling seems to confirm it's pretty unstable now; seems like it's not really usable anymore.

(I ended up just generating the SVG programmatically using Python shapely + matplotlib, which was better for my purposes anyway)



Seems like there's a good chance we get a 7-2 supreme court in a couple years. As someone who is pretty sympathetic to conservative jurisprudence, this still seems worryingly lopsided. Not sure there's much to be done in practice tho (except think about ways this should affect asset prices?).
in reply to Daniel Filan

Hmmm, Kalshi only gives this outcome a 23% chance. Can't tell if they're mistaken or I am.


Chapters of Familia Romana that are hard, according to me


For background, most chapters are only trying to do one or two 'things'.

Chapter 8:
- all the declensions of quis/quī, hic, is, and ille dropped on you in a single chapter
- in some sense only one 'thing', but that's around 144 forms you've got to remember (4 pronouns x 3 genders x 2 numbers x 6 cases).
- tbh I just went past this and hoped I'd get used to them rather than having to memorize them

Chapter 12:
- the fourth declension
- datives for possession (e.g. "Marcō ūna soror est")
- comparatives
- third declension adjectives
- datives of commanding and obeying

Anyway I'm up to chapter 13 which is less bad, hopefully the density of hard chapters does not increase.

in reply to Daniel Filan

I'm currently on chapter 27 and I have some aversion to re-reading it to do the exercises, because:
- it's just kind of boring (talk about how agriculture works, Julius demanding a tenant farmer pay some money he owes but relenting, Julius nearly punishing a shepherd for letting a sheep get away but then relenting)
- it's also quite long, longer than previous chapters
- it covers the present tense subjunctive which idk is not that hard I think

I'm currently procrastinating by reading the Cambridge Latin Course. But IDK maybe I should just skip the exercises and move on? Or do what I can without re-reading? The next chapter seems pretty cool...



Test of tusky


If I comment on this, will the comment also have the content warning? (which is apparently how these headings are implemented / what they're converted to)


Latin practice day 6


I. Num hominēs quī Berkeleiam incolunt barbarī sunt?
II. Mīlitēs armīs pugnant. Puerī pugnīs pugnant. Quō pugnat mercātor? Quibus pugnant pāstōrēs?
III. Num hominēs quī audiunt AXRP fortiōrēs quam illa quī audiunt The Inside View?
IV. Manūs sunt manūs bracchiī. Digitī sunt manūs manuum. Quae sunt manūs digitōrum?
V. Cūr medicus meus mē crassum esse dīcit?
VI. Potest homō avunculō avunculus esse?

#latinpractice



Susan Noble
‘Autumn Ferns’
From: https://x.com/marysia_cc/status/1855887688668524839


Panavise, Panavise
Every clamp holds so sweetly
Squared up right, fastened tight
You hold projects so neatly

Helpful jaw may you roll and yaw
roll and yaw forever
Panavise, Panavise
Bless my workbench forever



Ngl, I still get a little rush when I go to the store and see trading card packs, even for games I've never played.


Something I'm surprised I've never seen before: sacrificial anode toolbox liner. When I went to burning man last year, I brought my toolbox with me. It rained a good chunk of the week, and I needed to use my tools for various things. So of course, when I got back to civilization, some of my tools had rusted slightly. So today I took some copper fabric and wrapped up a couple magnesium rods, and lined the inside of the toolbox with them. Hopefully this will prevent much further rusting!


Apparently workout music also works for getting out of bed and beginning to go to the gym! At least it did for me just then.


Sometimes I'm confused by pro photographers




William H. Hays: Mountain Melody, 2022
linocut
From: https://x.com/marysia_cc/status/1854986382957330908/photo/1


I feel like commissioning episode art for AXRP is really causing me to run into the limitations of my imagination. Every episode I want to be like "hey can you draw a robot but it's evil" or "hey can you draw a scientist inspecting a robot".


From: https://x.com/0zmnds/status/1854995056660562070/photo/1


From: https://x.com/0zmnds/status/1854948751812837573/photo/1


From: https://x.com/opancaro/status/1854969896871870644/photo/1


Esa Riippa (Finnish, b. 1947)
Nocturno
From: https://x.com/marysia_cc/status/1854960702504812953/photo/1


People keep telling me I should use descript, but it doesn't run on linux :(


Paul Binnie Scottish, b. 1967
Moon at Shinobazu
From: https://x.com/marysia_cc/status/1854938647117640135/photo/1


Maybe this is paranoid, but should I try to buy a new laptop now rather than waiting until next year to save on potentially increased tariffs? I imagine those will probably take some time to pass?
in reply to Daniel Filan

Does anyone know the framework release cycle? Not sure if waiting trades off against much.


I think that, a little over four years after leaving the country, I finally have no bank accounts or any other ties in Hong Kong.

(They just abruptly cut off my ability to log in, but it was following a conversation about closing my account, so I assume that's what they've done. A more explicit confirmation would have been preferred, but oh well.)

I take this as a vindication of "no matter how late, still better than never" :)

in reply to Ben Millwood

I still have a phone number there actually, which is a little more difficult to let go of because I don't know who still has it and obviously if I cancel it I can't get that specific number back, but I guess once the dust has settled on this I'll be ready to let go


Misophonia has become a serious problem in my life. There's a person, who is totally great, and who is friends with all my friends and gets invited to everything I'm invited to, that I cannot be around for long periods because when they laugh my brain instantly jumps to an emotion not too far from blind rage.

I don't know what to do about this; it's very intense and basically preventing me from doing most social activities, even things in my own house.

in reply to Ben Weinstein-Raun

Sorry to hear it, this sounds very inconvenient and also socially very difficult to approach

My misophonia feelings have varied over the years FWIW. Certain mouth sounds bothered me a few years ago, then they didn't stand out again til a few weeks ago. But my thing has always been much milder than the thing you're describing.

Have you tried loop earplugs? I think they say they can help with misophonia, and my friend with more widespread misophonia said they help a bit. Idk if they'd help with someone's laugh though.



Does anyone have a phone client they recommend? I'm using Tusky which is fine but could be improved on (looks more twitter-like than FB-like)
in reply to Daniel Filan

I just use Chrome; seems to work well enough, though I can imagine why you might prefer a client app.


This last week of obsessively focusing on antenna design has been really fun, but it turns out that there's a whole world that's been continuing to exist, depositing things into my email inbox and slack channels.


If "tweeting" is the act of posting to Twitter, is the act of posting here "superstimulating"? 🤔
in reply to Ben Weinstein-Raun

it would be fun if we can configure this instance of Friendica to use those terms in notifications etc!
in reply to David Mears

This is technically possible since Friendica is open source, but I don't think the current version supports it :/


in reply to Satvik

This seems cool; my current favorite workflow involves storing a snapshot with an LLM-generated commit message every time I save a file, and when I'm ready to submit, doing a git reset and manually doing a few rounds of "git add -p" on a new branch.
in reply to Satvik

Ideally: Keep working in the "wip" branch, and then go back and modify the PR branch commits. In practice: haven't been doing enough team coding recently to have much actual experience with that aspect
in reply to Satvik



Latin practice day 5: Questions about ch 9 of LLPSI

I. Cūr lupī edunt ovēs, sed non edunt pāstōrēs?
II. Quantum est vestīgium lupī?
III. Est sōl in caelō, aut sunt mōntēs et collēs et campi etc in caelō?
IV. Homō est pāstor ovium. Quis est pāstor hominum? Quōrum pāstor est ovis?
V. Pāstor quaerit ovis quae errat. Ovis quaerit herba, quae ēstur. Quid quaerit herba?
VI. In campō sunt ūna ovis nigra et ūndēcentum ovēs albae. Pāstor habet unum canem nigrum. Cūr nōn habet ūndecentum canēs albōs?
VII. Campus dat silvae ovem. Quid dat silva campō?
VIII. Habet arbor umbram, aut habet umbra arbōrem?

#latinpractice



Tanaka Ryōhei
Persimmons . Mountains

From: https://x.com/marysia_cc/status/1853326310933770502/photo/1



I spent the whole weekend building this... Err... "Portable" directional 2m #hamradio antenna with @Jen Blight, and it was pretty great.

(It's portable in the sense that it only weighs about 2lbs, and can be taken apart to fit in a backpack. I hope to make substantially more portable ones in the future).

The elements are made from 0.5" PEX irrigation tubing (think hula hoop material), wrapped in copper-coated fabric and copper tape, with a layer of clear "repair" tape to prevent tarnishing, while preserving the mad wizard staff vibes.

Each tube is connected to itself using Anderson powerpole connectors, so they can be easily unplugged, and twisted into a smaller footprint for travel.

The boom is a fiberglass driveway marker, and the mast is a telescoping fiberglass pole.

I don't yet know how well the antenna performs, and as you might be able to see from the VNA, it needs a little tuning. But my radio didn't emit any smoke when I transmitted using it! And the signal made it clearly to its intended target (who was about 30ft away).

in reply to Ben Weinstein-Raun

Update: I definitely see a fair chunk of gain! Not enough to communicate with Jen all the way in SF from my back yard, but my field strength meter shows ~40 units when my standard whip antenna only gets ~10. I'm not sure how to interpret this, but if it's roughly proportional to power, I think this implies roughly 6dBd / 8dBi? Also as far as I can tell the front/back ratio is excellent; saw barely any movement in the meter when placed behind the antenna.

I'm pleased with this as a first attempt!



Shiro Shirahata- Moon over Fuji, 1972.

From: https://x.com/MenschOhneMusil/status/1853201333983023451/photo/1