@TracingWoodgrains comments on "Culture War Roundup for the week of December 18, 2023

Culture War Roundup for the week of December 18, 2023

This weekly roundup thread is intended for all culture war posts. 'Culture war' is vaguely defined, but it basically means controversial issues that fall along set tribal lines. Arguments over culture war issues generate a lot of heat and little light, and few deeply entrenched people ever change their minds. This thread is for voicing opinions and analyzing the state of the discussion while trying to optimize for light over heat.

Optimistically, we think that engaging with people you disagree with is worth your time, and so is being nice! Pessimistically, there are many dynamics that can lead discussions on Culture War topics to become unproductive. There's a human tendency to divide along tribal lines, praising your ingroup and vilifying your outgroup - and if you think you find it easy to criticize your ingroup, then it may be that your outgroup is not who you think it is. Extremists with opposing positions can feed off each other, highlighting each other's worst points to justify their own angry rhetoric, which becomes in turn a new example of bad behavior for the other side to highlight.

We would like to avoid these negative dynamics. Accordingly, we ask that you do not use this thread for waging the Culture War. Examples of waging the Culture War:

Shaming.
Attempting to 'build consensus' or enforce ideological conformity.
Making sweeping generalizations to vilify a group you dislike.
Recruiting for a cause.
Posting links that could be summarized as 'Boo outgroup!' Basically, if your content is 'Can you believe what Those People did this week?' then you should either refrain from posting, or do some very patient work to contextualize and/or steel-man the relevant viewpoint.

In general, you should argue to understand, not to win. This thread is not territory to be claimed by one group or another; indeed, the aim is to have many different viewpoints represented here. Thus, we also ask that you follow some guidelines:

Speak plainly. Avoid sarcasm and mockery. When disagreeing with someone, state your objections explicitly.
Be as precise and charitable as you can. Don't paraphrase unflatteringly.
Don't imply that someone said something they did not say, even if you think it follows from what they said.
Write like everyone is reading and you want them to be included in the discussion.

On an ad hoc basis, the mods will try to compile a list of the best posts/comments from the previous week, posted in Quality Contribution threads and archived at /r/TheThread. You may nominate a comment for this list by clicking on 'report' at the bottom of the post and typing 'Actually a quality contribution' as the report reason.

Jump in the discussion.

No email address required.

TracingWoodgrains 4mo ago

New from me - Effective Aspersions: How the Nonlinear Investigation Went Wrong, a deep dive into the sequence of events I summarized here last week. It's much longer than my typical article and difficult to properly condense. Normally I would summarize things, but since I summarized events last time, I'll simply excerpt the beginning:

Picture a scene: the New York Times is releasing an article on Effective Altruism (EA) with an express goal to dig up every piece of negative information they can find. They contact Émile Torres, David Gerard, and Timnit Gebru, collect evidence about Sam Bankman-Fried, the OpenAI board blowup, and Pasek's Doom, start calling Astral Codex Ten (ACX) readers to ask them about rumors they'd heard about affinity between Effective Altruists, neoreactionaries, and something called TESCREAL. They spend hundreds of hours over six months on interviews and evidence collection, paying Émile and Timnit for their time and effort. The phrase "HBD" is muttered, but it's nobody's birthday.

A few days before publication, they present key claims to the Centre for Effective Altruism (CEA), who furiously tell them that many of the claims are provably false and ask for a brief delay to demonstrate the falsehood of those claims, though their principles compel them to avoid threatening any form of legal action. The Times unconditionally refuses, claiming it must meet a hard deadline. The day before publication, Scott Alexander gets his hands on a copy of the article and informs the Times that it's full of provable falsehoods. They correct one of his claims, but tell him it's too late to fix another.

The final article comes out. It states openly that it's not aiming to be a balanced view, but to provide a deep dive into the worst of EA so people can judge for themselves. It contains lurid and alarming claims about Effective Altruists, paired with a section of responses based on its conversation with EA that it says provides a view of the EA perspective that CEA agreed was a good summary. In the end, it warns people that EA is a destructive movement likely to chew up and spit out young people hoping to do good.

In the comments, the overwhelming majority of readers thank it for providing such thorough journalism. Readers broadly agree that waiting to review CEA's further claims was clearly unnecessary. David Gerard pops in to provide more harrowing stories. Scott gets a polite but skeptical hearing out as he shares his story of what happened, and one enterprising EA shares hard evidence of one error in the article to a mixed and mostly hostile audience. A few weeks later, the article writer pens a triumphant follow-up about how well the whole process went and offers to do similar work for a high price in the future.

This is not an essay about the New York Times.

The rationalist and EA communities tend to feel a certain way about the New York Times. Adamantly a certain way. Emphatically a certain way, even. I can't say my sentiment is terribly different—in fact, even when I have positive things to say about the New York Times, Scott has a way of saying them more elegantly, as in The Media Very Rarely Lies.

That essay segues neatly into my next statement, one I never imagined I would make:

You are very very lucky the New York Times does not cover you the way you cover you.

[...]

I follow drama and blow-ups in a lot of different subcultures. It's my job. The response I saw from the EA and LessWrong communities to [the] article was thoroughly ordinary as far as subculture pile-ons go, even commendable in ways. Here's the trouble: the ways it was ordinary are the ways it aspires to be extraordinary, and as the community walked headlong into every pitfall of rumormongering and dogpiles, it did so while explaining at every step how reasonable, charitable, and prudent it was in doing so.

Context

BoneDrained TracingWoodgrains 4mo ago · Edited 4mo ago

Roko was banned for revealing Alice and Chloe's real names. It's not hard to figure out their names, but I'll refrain from revealing them, to prevent the search engines from linking them to this.

I want to highlight this comment, contrasting the nonlinear environment with normal professional employment. Erica had the insight that Alice and Chloe might be "exploited immigrants," and indeed they are from Germany and Denmark.

Chloe is still active in EA, with a similar job title, but hopefully her current job is lower stress and more aligned with her interests. Her boyfriend from Puerto Rico has also continued in the EA space and has several posts on EA forums.

Alice has been deleting some of her online activity, and possibly changing her name. She frequents vegan restaurants and continues to be poly (amazingly, with prediction markets).

curious_straight_ca BoneDrained 4mo ago

When the real names are that easy to find, the ethics of enforcing a prohibition on 'doxxing' get a bit weird. What, exactly, are you protecting?

Probably, most people are just lazy and won't look anyway, so it still has a significant effect on the number of peripheral people who know. But I think people feel like they're really protecting alice/chloe's names more than they are.

It's also somehow funny that he only got a 1 week ban from the forum. It feels very short.

(note: I only quickly crosschecked with your descriptions, not with the nonlinear post content)

BoneDrained curious_straight_ca 4mo ago

When Scott wrote that the NYT article would make his job more difficult, I was sympathetic but curious. It's easy to find his previous handle yvain and his old blog and then his personal site where he has his name. Despite his poor op-sec of not making a hard break between identities, patients couldn't easily google his name to find his somewhat controversial postings. All was well until adding Scott's full name to Metz's article did more harm than good.

The current case isn't quite so clear. Starting with Alice and Chloe's real names, you quickly (without archive.org) find references to Nonlinear. But you have to put a few things together to connect with the current controversy.

Even without doxxing, it may be awkward when Alice/Chloe apply for their next job in the EA sphere. Upon seeing their résumés, the interviewer might ask, "was your experience at infamous Nonlinear as bad as Alice's?"

Of course, the reputations of Ben / Kat / Emerson are much more directly impacted. I think the common theme is that they didn't know (or care) about the normal standards for investigative journalism / employment. Not that I would've done much better, but spirited rejection of Chesterton's fences to escape local optima probably makes things worse.

slothlikesamwise Subway Wizard 🔮 curious_straight_ca 4mo ago

What, exactly, are you protecting?

Norms, generally. Deanonomizing people is something I'd rather not become allowable, and the incompetence of others shouldn't affect me.

Folamh3 TracingWoodgrains 4mo ago · Edited 4mo ago

I'm getting incredibly sick of the "rationalist" affectation/verbal tic of "statistically" "quantifying" your predictions in contexts where this is completely meaningless.

"But I think it would still have over a 40% chance of irreparably harming your relationship with Drew"

"Nonlinear's threatening to sue Lightcone for Ben's post is completely unacceptable, decreases my sympathy for them by about 98%"

What it does mean to have a 40% chance of irreparably harming her relationship with Drew? Does that mean that there's a 60%, 70% etc. chance of it harming her relationship with Drew, but in a way that could be fixed, given enough time and effort? What information could she be presented with that would cause her to update her 40% prediction up or down?

The numbers are made up and the expressions of confidence don't matter. It's just cargo cult bullshit, applying a thin veneer of "logic" and "precision" to a completely intuitive gut feeling of the kind everyone has all the time.

curious_straight_ca Folamh3 4mo ago

Here's the rationalist theory:

Let's say you do this ten thousand times over the course of a few years. Make a list of every prediction, and count how many predictions in the '40%' category were true. If it ends up as '40%', you're making good predictions. Or, take all your predictions and score them according to a brier score or another scoring rule, and if you have a low score you're making good predictions.

The theory is that you can take statements and make predictions, and often the best you can do is '60%' or '20%' while maintaining a good score, and that this says something about the structure of decisionmaking.

I don't like it personally. I think the complexities you need to explore are mostly unrelated to the exact numbers. But you probably can, after the fact, in most scenarios say 'yeah, her relationship was harmed' or 'no, it wasn't', and then score your prediction, and if your calibration is reasonable and you're not manipulating them then it might mean something!

MaiqTheTrue Zensunni Wanderer Folamh3 4mo ago

I disagree. Even if the numbers are somewhat made up, having a ballpark figure that tells you the relative probability of certain events that would result from a decision you’re planning to make.

Going to the Drew example, if I think that doing something (say going to school in another city and trying to have a LDR is going to result in a 40% chance that I’ll lose the relationship entirely, and a 60% chance that I’ll damage it in away that would be difficult but not impossible to fix, then I can use that to decide if that would be more important to me than the job opportunities, the scholarships, or whatever else I gain from going to school away from him. Might doesn’t give you enough information for a true reality check imo, because it treats low probability events equally to large probability events. Even using verbal categories like low, medium and high probability, especially when making a group decision aren’t precise enough to communicate what I’m actually thinking. Low is how low? For you it might be 5%, for me it’s 20%. We can’t communicate that well if we don’t know what the terms are.

twodigits MaiqTheTrue 4mo ago

I see an opening parenthesis without a closing one. Is your comment unfinished?

Gillitrut Reading from the golden book under bright red stars MaiqTheTrue 4mo ago

Even using verbal categories like low, medium and high probability, especially when making a group decision aren’t precise enough to communicate what I’m actually thinking. Low is how low? For you it might be 5%, for me it’s 20%. We can’t communicate that well if we don’t know what the terms are.

I think part of the point is the numerical values convey an unwarranted degree of precision based on the process that generated them. Say your estimate is 20% probability for X. Why not 21%? 19%? 25%? 15%? What's the size of the error term on your estimate? Is your forecasting of this outcome so good as to warrant a <1% margin? Of course, estimation of that error term itself has the same problems as generating the initial estimate.

curious_straight_ca Gillitrut 4mo ago

I don't think this is a good objection. Numbers are often approximate. 20% means 'somewhere between 10% and 30%' as much as 'around a hundred pounds' might mean '75-125 pounds'. On the other hand, I usually think it's better to actually say what ideas and conditionals inform your judgement rather than just saying a number, and I'm not sure what the number adds to the former.

KnotGodel curious_straight_ca 4mo ago · Edited 4mo ago

There is some deep epistemological stuff operating here.

If the complaint is that numbers are too "precise", and the solution is to add ranges (whether implicit or explicit), the obvious next question is what these ranges mean.

In formal Bayesian epistemology, there are no "error bars" around probability estimates. The probability estimate is simply the probability you believe X will occur, which can be computed using Bayes formula and a prior probability in simple cases (more complicated cases yield more complicated formulas).

In less Bayesian epistemology, you can recapture much of this using something like proper scoring rules, but, again, a scoring rule only asks for a number - not a range, so it's still unclear what it even means to say "somewhere between 10% and 30%".

In human language, you might want to say something like "I'm saying 20%, but that's because I've consulted the important evidence, but I know there is smaller evidence I'm not considering that might tweak that ±10pp." In this case, you are using error bars to indicate logical uncertainty.

Alternatively, there is a decent analogue in finance: market makers often have both buy and sell limit orders. The spread between them is an indicator of confidence. Note: in finance, the spread is useful only because finance is a fundamentally social endeavor. If you choose to offer a very small spread, what you're really doing is saying that you don't think anyone else can do better than you. [note: this is less true for non-market-makers, who, for binary instruments mostly just care about point estimates].

The financial framing has an obvious betting analogy here: when you say "between 10% and 30%" under this interpretation, you're actually saying is that you would accept a bet that X is true if the odds given were higher than 9:1 and that X is false if the odds were better than 7:3. If we wanted to formalize error bars on probability, this is the model I would advocate for.

[To be a tad pedantic, finance also cares about risk. If I'm market making, I might actually have multiple buy and sell limit orders with different volumes. Likewise, me saying "between 10% and 30%" might mean I'm willing to bet 1¢ at those odds, but it doesn't necessarily imply I'm willing to bet half my 401k.]

@Gillitrut

MaiqTheTrue Zensunni Wanderer curious_straight_ca 4mo ago

The numbers at least for me give me a ballpark estimate of what I think will actually happen given a certain set of conditions. If I say 25% (which in my mind is generally within 10% of the number I give) that communicates in a way that “low probability” doesn’t because “low” doesn’t mean anything. My low might be 25%, your low might be 5%. And making decisions, in a group setting especially, requires precision so that when weighing options you can know with some degree of certainty what people think are likely and unlikely and to what degree. This allows you to discuss whether an X% (+/-10%) risk of something happening being serious enough to make that decision a bad idea. If low can mean anything between 5% and 35%, it’s going to cause people to either overestimate the risk and be too cautious, or underestimate it and take risks that they might not take otherwise.

Gillitrut Reading from the golden book under bright red stars curious_straight_ca 4mo ago

My point is that it's important to make that uncertainty explicit because not everyone talking to you is going to understand that. Maybe you think 20% is shorthand for 10-30% but someone else thinks it's precisely 20% or is actually 15-25% or some other range. I think the "around a hundred pounds" is a good example because "around" conveys a degree of uncertainty on the "hundred pounds." If I was quoted a price of some good at "a hundred pounds" (no "around") and later found out it was actually 125 I would feel like I was deceived.

Probability already inherently indicates uncertainty though! You can just say you're combining the different 'levels' of uncertainty (what that means is debatable), and the average of [10..30] is 20.

But the average of [5..35] and [0..40] are also 20. Do you think all three of these ranges are conveying the same information because their average is 20? I don't.

quiet_NaN TracingWoodgrains 4mo ago

Interesting article.

In countries where prior restraint is common, keeping the target of an investigation in the dark might be the only way to publish at all. Of course, in the US prior restraint is virtually non-existent, so that this certainly does not apply to this case.

Likewise, there might be tactical considerations not confront the target of an investigation early on. "Dear Macbeth, just as an heads up, we are currently investigating an alleged involvement of you into the demise of the previous king" is certainly not required. But if you have spend months gathering the facts, then you should give the other party some time to respond.

In general, there is a cooperative mindset and an adversarial mindset. From my perspective, both can be wrong in some cases. If your target has clearly defected from humanity so completely that no further cooperation with them is ever possible, trying to destroy them with your investigation may be imperative. So if you discover proof that the Nazis are running death camps, you will probably not want to give them two weeks of time to do damage control, preempt your story by releasing damning information with their own framing and generally put their spin on it.

In the real world, few people and organisations are so beyond redemption that destroying them by turning arguments into soldiers is worth the price on your soul and the damage to the epistemic commons. The Sequences are very clear that humans are not naturally good on doing Bayesian updates from partisan information.

Civil courts work (mostly) with two adversarial sides presenting self-interested arguments because there are certain standards of evidence and the judges know the takes to be partisan and spent some effort to find the truth between the stories of both sides.

The court of public opinion may occasionally stumble on the truth if a matter appears very one-sided and also is very one-sided, but in most cases, almost nobody will find the shy flower of the truth on the Verdun-esque battlefield left by unrestricted argument warfare.

Fruck Lacks all conviction TracingWoodgrains 4mo ago

Excellent piece, great work all round with an important point I hope gets through to the ea community. To be fair to that Oliver guy re the hypothetical I can't blame him for thinking that "ignore complaints from the target and publish on time, they're probably lying anyway" is the standard for investigative journalism, because regardless of what professional journalists are taught (and taught to say), that is often how the majority of them behave. Which is not to say Herzog or Singal behave that way, or even Lewis (I don't recall ever reading investigative journalism by her, but I know she is a very careful and serious journalist), just that if Oliver hadn't been an amateur looking on from outside the profession he would have known there is only one answer any journalist who wants to maintain their reputation can give.

To be charitable to Oliver, that could also be why he thinks deadlines matter at all outside scheduled publications, although from other points you made in the piece I think it's clear he just wanted to publish it as soon as possible (which should have been a red flag for anyone close to him prior to publication that his motives were skewed.)

pro_sprond TracingWoodgrains 4mo ago

Great post. This whole controversy is pretty fascinating but also seems like something you could sink dozens of hours into learning about without coming to any clear conclusions about what actually happened, who's telling the truth, etc. Nevertheless, here are a few things that come to mind after reading a bit about it.

The original investigation by Ben Pace seems clearly negligent. Perhaps you could justify giving Nonlinear very little time to respond, but many of the claims in the original post are presented with evidence that seems to amount to little more than "Alice and Chloe told me this and they seem trustworthy to me (even though lots of people told me Alice is not trustworthy." The post also claims that "I personally found Alice very willing and ready to share primary sources with me upon request (texts, bank info, etc)" but often does not reference the primary sources supporting various factual claims that it makes.
The original post also features almost no quotes or perspectives from other employees of Nonlinear. But if you are claiming that Nonlinear has an abusive work culture, such perspectives seem clearly relevant. There is one section labelled "Perspectives From Others Who Have Worked or Otherwise Been Close With Nonlinear" but this section is very vague and often makes claims that are not backed up with quotes. The quotes it does include are lacking context and it's often unclear who they are attributed to. For example, one quote is preceded by "Another person said about Emerson:" But we are not told who this person is or what their relationship to Emerson is.
These people seem to have a pathological obsession with anonymity. I understand the argument for keeping some people's identities secret, but often so much information about people being quoted is removed that it is hard to tell how to evaluate it. One example is described in the previous bullet point. For another example, see the list of 28 times 'Alice' accused people of being abusive from the Nonlinear response post. It includes things that are almost impossible to evaluate like:

Alice accused [Person] of [abusing/persecuting/oppressing her]

Alice accused [Person] of [abusing/persecuting/oppressing her]

Alice accused [Person] of [abusing/persecuting/oppressing her]

Alice accused [Person] of [abusing/persecuting/oppressing her]

More generally, it strikes me that both reports are very badly written. Compare them to basically any investigative report by a high quality news organization like the New York Times. No matter what you think about the NYT's bias, accuracy, etc, their articles are typically clear and easy to read. They clearly lay out the context for the story and the overall narrative and they manage to do so while supporting most of their claims with specific quotes from either named individuals or people whose role in the story is clearly explained and they typically include quotes from outside experts to contextualize things. Importantly, they also do so relatively concisely. Ben Pace's original report is about 10,000 words! And yet, it does a worse job providing context, evidence for its main claims, and a clear narrative than many 2000 word NYT articles.
Part of the reason both reports are so badly written is that they spend so long on haranguing the readers about how they should feel about the evidence provided. The original report begins with a paragraph-long "epistemic status" and spends a huge amount of verbiage analyzing the author's (i.e. Ben Pace's) own opinions about how much to believe what he wrote. But these feelings seem to mostly boil down to "I think that Alice and Chloe are fairly trustworthy and feel that there is evidence supporting their accusations." But instead of spending so many words saying this, why not just present the evidence as clearly as you can? I understand that some of the evidence may be inconclusive, but then why not present it as such and let readers draw their own conclusions? To an outsider, the post has an atmosphere of "I, Ben Pace, am a responsible and trustworthy person and so you should trust that I have studied this issue carefully even though I won't present most of my evidence."
Ignoring the truth or falsity of the various accusations, the whole setup sounds pretty crazy. Even if Nonlinear is not at all abusive, it seems like a terrible idea to accept a job where you'll be viewed as "part of the family" or "part of the gang." And why were they jetting around the world, staying in exotic locations in the Bahamas, etc anyway? Is that necessary or helpful in doing work on AI safety? I realize that Nonlinear was supposed to be at least partly an incubator, but to me it seems to have been much looser and blended work and personal life much more than most other incubators. Perhaps that's what some people want, but it seems to come with big risks (which this blowup demonstrates).

What is this place?

Why are you called The Motte?

New post guidelines

Rules

Recommended Posts And Communities

Recommended Realtime Chats