canyonwalker | Reading

xkcd_feed

https://xkcd.com/3105/

We're getting a lot of complaints from commuters who were routed onto a coaster, but the theme park patrons who spent hours stuck on an intercity line are also not happy.

https://xkcd.com/3105/

gnomestew_feed

Posted by J.T. Evans

https://gnomestew.com/collaborative-storytelling-loop/

https://gnomestew.com/?p=53011

Most RPGs tell a story of some kind. Some are deeper than others, but there is a story hiding under the cover of the mechanics somewhere. Since most RPGs are played in a group, this makes the storytelling collaborative. At least, it should be collaborative. If it’s not, then the person telling the story to the group may as well be sitting at a campfire and telling the others a ghost story. While that’s perfectly fine for entertainment purposes, that’s not the point of playing a role playing game.

To help everyone make their tales more collaborative, I’ve come up with the concept of a storytelling loop that I’ll explain below. While this loop might sound like hard and fast instructions, please don’t treat them as such. If the loop needs to be adjusted based on circumstances, do so. This is not an instruction manual for assembling a set of shelves or a desk. This is guidance via a framework to help you think more clearly about how things could flow.

Group Effort

The collaborate you need a group.

Obviously, to collaborate you need a group. This means everyone in the group should be involved in the telling of the tales. Some people will shine at times while others fade to supporting roles. This is perfectly natural, but the GM (and other players) should be aware of when an individual shines too much or too brightly and refocus the attention on another person. This is called “spotlight management” and there are several other articles here on Gnome Stew that delve more deeply into this topic. Just search for that phrase, and you’ll find a wealth of information.

No One Side of the Screen is Responsible

Use both sides of the screen.

As is implied by the above “group effort” section, the GM is not solely responsible for the storytelling loop. It’s a back-and-forth with handoffs and flowing between GM-provided narrative and information and the PCs responding to that new information.

Flow State

Even though what I’m going to present below implies a smooth handoff between GM and PCs, this is not always the case. The GM may have a PC do something that interrupts the loop or the PCs may latch onto something the GM has presented that breaks the flow. That’s perfectly fine. From my experience, however, what I’m outlining below is the typical flow for the storytelling elements that come into play.

Yes, And

To keep the flow going, don’t just agree with another player’s idea. Tack onto it. Add to it. Build upon what they’ve offered up and keep the story going.

No, But

If someone puts something out there and you (either as GM or player) don’t fully think it’ll fit into the current story that’s being told, don’t just hard stop the idea. Take the concept and redirect it with a “but” clause that gets it back into the flow state of the story.

The Storytelling Loop Framework

GM Introduces Scenario
Optional: PCs React to Scenario
GM Triggers Events
PCs React to Events
PCs Make Choices
PCs Take Actions
GM Reacts to PCs’ Actions
GM Triggers New Story Events
Repeat Steps 4 through 8

1) GM Introduces the Scenario

Time, place, NPCs, and scenario are introduced.

The start of any decent story introduces a time, place, maybe some NPCs, and a scenario that is unfolding (or has just occurred). This is the initial setup of the adventure where the PCs band together (if they’re not already together) to face a threat, goal, situation, enemy, or to set out toward a destination.

2) Optional: PCs React to Scenario

What are you going to do about it?

Once the scenario is established, the PCs might want to do some prep work. This could be divination, research, travel, acquiring provisions and equipment, talking to NPCs, formulating a plan, or coming up with ideas on what might come next. This doesn’t always happen because the “GM Triggers Events” can sometimes happen immediately after the scenario is established. This is common in a “in media res” style of storytelling where the action hits close on the heels of the story’s introduction.

3) GM Triggers Events

Trigger action in your PCs.

The GM then sets up and triggers some events that impact the PCs. These events could be an attack, the start of a heist, a critical discovery, acquisition of a map, gaining new information, or witnessing something happening that the PCs want to aid or prevent. The key here is not what kind of event is triggered, but to ensure at least a few of the PCs have a motivation to interact with the event being presented. If a man is resisting arrest in front of the PCs, there might not be any motivation attached to stop the man or the authorities. However, if the man is a close friend of several of the PCs, they might want to at least interrogate the reason for the arrest, if not stop it. On the flip side, if the man is a dire enemy and appears to be close to escaping his arrest, the PCs will want to step in and ensure he doesn’t get away.

4) PCs React to Events

React and choose!

The event presented by the GM should cause some flavor of reaction in the PCs. It is preferable that this be an emotional reaction combined with a physical one. A mere physical response out of necessity is not as visceral or deep as actions taken because of an emotional attachment.

5) PCs Make Choices

The PCs must decide (sometimes quickly; sometimes methodically) on what their next actions will be in response to the presented event. The timeframe of how quickly the PCs must react will depend on the type of story being told and the urgency of the event put in front of them.

6) PCs Take Actions

Don’t over analyze. Act now!

Once the PCs have reacted and made some choices, they have to put their plans into action. This could be as straightforward as drawing steel against the orc raiders to save the kidnapped prince. If the prince was just discovered kidnapped and the trail has grown cold, then the PCs’ actions might involve an investigation of the crime scene or tracking the kidnappers or talking with witnesses to gather more information in order to find where the kidnappers took them missing prince.

The number of actions available to the PCs are countless, but should be guided and focused by the event in front of them. I call this a “creative compass.” What I mean by this, is that the GM should provide some hints or choice options to limit the amount of “analysis paralysis” the PCs will go through. If the PCs merely know “the prince is missing” then they don’t really have enough information to formulate an intelligent response. If the event is “the prince is missing and several people saw a squad of orcs carrying a bound child through the darkness at the edge of town to the north,” then the PCs can do a minor amount of investigation before delving into the wilderness to the north.

7) GM Reacts to the PCs’ Actions

The only constant is change.

Depending on what the PCs actions are, there are numerous effects they’ll have on the story. I’ve grouped these changes into the following buckets:

Story arc change of direction
Setting changes
NPC changes/additions/losses
Enemy changes/additions/losses
Ally changes/additions/losses
Consequences and Adverse Effects for the PCs
Potential Rewards for the PCs
- Treasure
- Experience
- Allies
- Favors
- Social Status

These changes to the story, I hope, are self explanatory. If someone out there wants me to delve deeper into these, I can do so in another article along these lines. Just drop a comment below and let me know.

8) GM Triggers New Story Events

Inspire more action with a fresh event.

As with step 3 above, the GM will introduce a new event based on what has changed in the story. These changes will be based on the outcome of the PCs’ actions. Instead of moving forward with a pre-written response to the action outcomes, the GM should adjust things as necessary to make sure the new story event aligns with what just happened. Sometimes, your pre-written event will work perfectly. Most of the time, GMs will have to make minor adjustments. There are rare cases, however, where the pre-written prep the GM has done will need to be massively reworked or thrown out.

9) Repeat the Process

Rinse and repeat as needed.

Clearly, if the story has not come to a satisfactory conclusion (returning the kidnapped prince from the clutches of the orc raiders and back to the royal castle), the story must continue. Repeating steps four through eight will keep the flow going on the story. Once the story arc, character arc, or campaign arc have come to a conclusion, the GM should wrap things up with the PCs and not introduce a new event. If the GM does introduce a new event, this is an indication of a fresh story/campaign arc that is about to be told.

Addendum: Automated Triggers on a Timer

Tick tock goes the clock.

There are some external factors that can modify GM events or PC actions. Many of these run on a timer, but some may be pseudo-random. Examples of automated triggers are:

Weather Changes
NPC actions
Enemy plans and actions
Movement of celestial bodies
Random encounters
Random treasure

Conclusion

I hope this description of a collaborative storytelling loop helps you identify where you’re getting things right and maybe areas for improvement. If nothing else, I hope this article gave you something to mentally chew on and apply in your role playing games.

https://gnomestew.com/collaborative-storytelling-loop/

https://gnomestew.com/?p=53011

canyonwalker

Today was the start of a four-day weekend. We'd had plans to travel but canceled those plans a few days ago due to lingering sickness. Instead I stayed home and basically wasted the day. 😞

I had trouble sleeping last night. I got down to sleep okay, going to bed a bit early (before 10pm) as I'd been up since 5:30am. But then I awoke at 3am and could not get back to sleep. I tossed and turned in bed for a bit, willing myself to get back to sleep, but gave up after 30 minutes or so when I could tell that wasn't going to work. I got up and fooled around on my computer for 3 hours. I felt tired after the sun had risen and went back to bed at 6:30. Then I slept until almost 11. I'd fret about wasting half the day in bed but it's not like I had much else to do today.

Hawk and I got out late for lunch today. It was her first dining out in several days, since she was diagnosed with bronchitis. She still has a cough— we both do— but it's typically for a cough to linger without being contagious for a few weeks after this kind of sickness. I'm very familiar with the lingers-for-a-few-weeks cough, unfortunately. Anyway, since it was her first time out of the house in a few days we went to one of her local favorites, Speedy's Tacos. We got donuts after at Daily Donuts.

In the afternoon I continued my tempo of not doing much at home. Late in the day I got really bored and finally pulled out my sewing kit to fix seams and buttons on some pants and shirts that have been waiting for attention for, in some cases, 6 months. I had been debating the over-under of sewing them myself versus taking them to a tailor shop to be fixed for, I dunno, $10 or whatever each. Probably it would've made economic sense simply to pay for someone else to do the work, but I chose to do it myself because there's a certain Zen aspect to doing it. Even with the frustration of how it just gets harder to thread a needle as I get older.

canyonwalker

For several weeks we'd been planning to travel this weekend. Tomorrow is Juneteenth, which is a company holiday, and I used PTO for Friday to have a four day weekend. It'd be our first trip of the summer. Unfortunately we decided Monday to cancel it. Why? Because we're still sick.

Hawk visited the doctor on Monday and was diagnosed with bronchitis and an ear infection. I'm still getting over my cold... which may actually be/have been bronchitis, too. We hope to be better by tomorrow or Friday, but just in case we're not we made the tough decision to cancel our travel plans.

I hate missing the opportunity for a vacation, even a short one. Being sick really sucks.

sorcyress

Today was the day off to fuck around! Like, the single day I have to recover from burnout of the past school year and be ready for the upcoming several weeks of _stuff_. I'm sure that's fine, I'm sure that's how brains work.

Anyways, I spent the first ninety minutes of the day or so wandering around the various Alewife parks/reserves. There's a lot of good nature over there! I saw multiple turtles and a well posed squirrel and a pretty moth and some ducks! Also lots of green, which smelled overpoweringly of flowers and nature and that's weirdly good despite being a sensory nightmare.

Then I went home and played a _lot_ of days of Stardew Valley, ( the next two paragraphs are boring unless you're also into Stardew! )

In the real world, I wrapped up the evening by hanging with Ruthie and The Toddler for a bit. It was a good evening! A dinosaur drove a truck, which is entirely a correct thing for toddler toys to be doing. And the bedtime story was Magic School Bus and the Hurricane, which was great fun.

Tomorrow has many plans. Here are some of them:

*Call Tuesday, work out final plans/timing for the next few days

*Possibly create A Snack for the pride party

*Go to the grocery store

*Write an entire pre-Pinewoods todo list

*Clean the bathrooms

*Photograph free stuff so I can put it on Facebook and maybe even get it out of the house before I go

*Laundry is a Friday plan. So is packing for mine and Tuesday's adventures and maybe if I'm really good packing one (1) bag for Pinewoods.

Huzzah or whatever!

(And maybe after my dance party I can play a few more days of Stardew).

~Sor

MOOP!

xkcd_feed

https://xkcd.com/3104/

Numbers can be tricky. On the day of my 110th birthday, I'll be one day younger than John Tukey was on his.

https://xkcd.com/3104/

gnomestew_feed

Posted by Poddy Gnomington

https://gnomestew.com/gnomecast-215-getting-them-to-play-something-else/

https://gnomestew.com/?p=53004

Join Ang along with Jared and Tomas as they talk about how to get a group stuck on playing one particular game to try something else.

Links:

Wildmage Press

Deerstalker Pictures

The Forever DM

Hearth Magazine

https://gnomestew.com/gnomecast-215-getting-them-to-play-something-else/

https://gnomestew.com/?p=53004

canyonwalker

Episode 6.09 of Better Call Saul, "Fun and Games", is not the last episode in the series. There are 4 more after it. But it completes a story arc I've been wondering about since the start of the series: How does Kim's story end?

We've known since the start that Kim's story has got to end somewhere in this series. That's because of what I've repeatedly referred to as The Star Wars: Rogue One Rule. A major character in a prequel who doesn't appear in the original series— Kim is not mentioned at all by name in Breaking Bad— is doomed. The writers had to get rid of her to maintain continuity with the original series.

( Spoilers! )

sorcyress

I did it! I finished cleaning up my room! I was able to play School's Out For Summer at about 6:50pm or so, which is plenty good as these things go. *THUMP* goes the brain. I would like to do absolutely nothing! But things are coming up!

Here's some good things coming up in the near future:

*I'm going to go to Keuka (in the finger lakes in NY) with Tuesday's family! This appears to be the kind of vacation where you just chill out and read books and go for walks and things. I'm excited! I don't really know how to have that kind of vacation, so it'll be a good chance to try things out.

*I'm doing a road trip from Keuka to MD with Cameron (Tuesday's sibling) which sounds like it will be quite a nice time, honestly. I've road-tripped with Tuesday a fair amount, and ker mom a few times, so I'm excited to do it with another member of the family.

*Almost immediately after, I'm road-tripping back up from MD to Boston (and then ESCape) with mom and Wicked Auntie Robin. Mom and I are definitely compatible road trip buddies! I think Robin will be a good third!

(I recently got a kickstarter which is "gay games to play in the car with your girlfriend while she drives you to go camping" so I'm excited to have some road trips to play the gay games!)

*ESCape is going to be amazing, obviously. I'm the official scottish teacher! I get to MC in the evenings and teach two classes! AAAAH! Good aah, but also stressed aah.

*Once I'm back from ESCape I get to actually breathe for a week or so, which is going to be my play video games time. I am _extremely_ looking forward to it.

And then there's some other stuff in the back half of the summer. That's good too!

~Sor

MOOP!

solarbird

Today’s Tuesday. There’s a Tuesday #TeslaTakedown protest, Lynnwood, Tesla dealership on 99. 4:15pm TODAY, like usual. As I’m writing, that’s literally an hour and like 20 minutes away.

Yeah I kinda forgot too lol, but no reprieves for the fascist.

Let’s get out there~~~

Posted via Solarbird{y|z|yz}, Collected.

canyonwalker

Yesterday I wrote about trying shit beer just because it's well reviewed in a Buzzfeed listicle. Well, there was another beer on one of those shit lists that I was keeping in the back of my mind. Not because it's actually shit but because I've had it before and— at the time at least— didn't think it was shit. I mean, how could this beer be shit when it didn't just impress 10 random bartenders surveyed for a 2020s Buzzfeed listicle but inspired a blockbuster movie franchise in the 1970s and 1980s?

That's right, this beer inspired a movie.

I'm talking about Smokey and the Bandit.

And it wasn't just a touchstone of American cinema from 1977 but it also launched the career of movie star Burt Reynolds.

And the beer? I'm talking, of course, about Coors Banquet.

Coors Banquet, the original Coors beer (Jun 2025)

Coors Banquet, the original Coors beer (Jun 2025)

"Wait," you might be thinking, "Isn't Smokey and the Bandit about Burt Reynolds exuding ``country cool'' while outfoxing a bunch of incompetent Southern sheriffs who are trying to write him speeding tickets?"

Yes, it is about all that. But the driving motive of the story, the thing that kicks off all the action, is beer. And not just any beer, but specifically Coors Banquet.

It's part of American folklore that Coors, based in Golden, Colorado, didn't distribute its beer west of the Mississippi River until 1986. For decades people who traveled across the US found that this really tasty beer that was available in Western states wasn't available in the rest of the country. Among beer aficionados it became notorious. Travelers would bring cases home with them. Coors is even reputed to have been carried aboard Air Force One multiple times, as presidents Eisenhower and Ford were fans of it. Thus the MacGuffin for Smokey and the Bandit in 1977: a wealthy gambler in Atlanta challenges a truck driving team to bring him a shipment of 400 cases of Coors Banquet— illegally— in time to show off to his peers at a racing event. They have to drive from Georgia to Texas to fetch it, then back in record time to deliver it.

So, after that wind-up, how does it taste?

It's... not shit.

Unlike other "shit" beers I've tried— unlike pretty much all other traditional mass-produced US beers— it doesn't have a nasty taste. It doesn't have a nasty taste at the beginning that fades away, it doesn't taste nasty at the end. It doesn't taste nasty on its own, it doesn't tasty nasty with food. It just tastes... decent. Like mass-produced US beer if it... wasn't shit. 😳

Mind you, it's not a richly flavorful beer. It's basic beer. But it's decent basic beer. Like, I could set this alongside any of countless basic European beers and it'd fit right in.

Mind you #2, this is not Coors Light. The "Silver Bullet" is a hugely selling beer, propelled by the sales and marketing behind its lower calorie formulation. It's lower in calories and also way lighter in taste. Coors Banquet is the original Coors and has a richer taste.

So, pop open a gold label Coors and watch country-cool Burt Reynolds outfox a bunch of dim-witted Southern sheriffs trying to write him a speeding ticket.

xkcd_feed

https://xkcd.com/3103/

Sure, this exoplanet we discovered may seem hostile to life, but our calculations suggest it's actually in the accretion disc's habitable zone.

https://xkcd.com/3103/

solarbird

Farty McShitgibbon squirted out another racist authoritarian shart on Pravda Sotsialnaya late Sunday night, blowing off steam caused by his frustration at protests and over his shitty birthday parade. It’s a screed of lies about Democrats being sick in the head and not actually Americans, sending more waves of cops and military at us, and about cheering on mass deportations. Like y’do, if you’re a fascist fuckhead.

But let’s pay particular attention to this bit:

…I have directed my entire Administration to put every resource possible behind this effort, and reverse the tide of Mass Destruction Migration that has turned once Idyllic Towns into scenes of Third World Dystopia. Our Federal Government will continue to be focused on the REMIGRATION of Aliens to the places from where they came…

CAPS as in the original. Bold added.

Remigration pretends to be but is not actually a subtle word.

Remigration means ethnic purge. Ethnic cleansing, if you insist – but I hate that term, because there’s nothing cleansing about it. It’s an ethnic purge. It’s violent, it’s bloody, it’s repression, it’s expulsion at the point of a gun, based on your ethnicity.

Remigration means ancestry-based expulsions of people who aren’t white, for being not white, and not for any other reason. Just for being not white. Legal status means nothing; being a citizen means nothing. Not white? Get the fuck out, at gunpoint. That’s how it went during Operation W*tback; that’s how he wants it to go again, and now he’s just saying it.

This is not a question; this is not up for debate. If you feel like debating it with me, screw you you fascist-apologist fuck, and sit the fuck down. In this context – in any context outside academia – that’s what it means. Go read Wikipedia if you want the definition and history.

“Remigration” in this context absolutely and only means ethnic purge. All the leading haters in his administration – Miller, I’m looking in your direction – know damn well what it means. They used the word on purpose.

So I’m asking you not to discuss this shart of a statement without going directly to what REMIGRATION means. Do not let anyone lie and pretend it means something else, because that’s bullshit. Remigration is why they’re revoking every legal status they can; remigration is why they’re trying to end birthright citizenship; remigration means a violent ethnic purge.

And that’s what he’s told ICE – or more generally, “ICE, FBI, DEA, ATF, the Patriots at Pentagon and the State Department” – to make happen.

So. What do we do, in particular, right now?

First off, make sure people know what this word means.

Second – know any “good cops”? Like, relatives or something you might actually be able to reach? Show them the Wikipedia article about what “remigration” means, and only then show them Trump’s statement. Make sure they understand what Trump said, and make sure they know what they’re being asked to do. Make them understand, like… now.

Make sure they understand the criminal act they’re being asked to perform.

Because that is, absolutely, the ask, and “just following orders” … just won’t do.

Posted via Solarbird{y|z|yz}, Collected.

canyonwalker

It was a quiet weekend at home for us. No, not in the "Let's spend hours sitting by the pool on a warm day!" fun kind of way. Though I did get out for a soak in the hot tub Sunday morning. Our around-the-house weekend was a mostly in-the-house weekend as we are still dealing with being sick and didn't have much energy for getting out and about.

For me, I'm mostly over my cold. The worst days were Wednesday and Thursday last week, thanks in no small part to allergies combining with the cold. But a hacking cough has stuck with me since then. Oddly, OTC cough medicines (guaifenesin and dextromethorphan) don't help this cough but nasal decongestant pseudephedrine does. We're going to need to hunt for more of that since drugstores no longer stock it regularly thanks to the government cracking down on a real or perceived drug epidemic.

Hawk has the same bug I've had but is a few days behind where I am on the curve. That means while Wed-Thu were my worst days, hers were this weekend. Thus she didn't have energy or desire to go out anywhere, not even for a short hike like we'd hoped as of Friday.

oots_feed

http://www.giantitp.com/comics/oots1328.html

gnomestew_feed

Posted by Matthew J. Neagley

https://gnomestew.com/a-case-study-in-dice-stat-tests-part-1-general-approach/

https://gnomestew.com/?p=52899

Recently the Kickstarter for “Honest Dice | Precision Machined Metal Dice You Can Trust” ended, having successfully funded for over half a million dollars. The Kickstarter was sent to me in the final few hours, so while I initially had some strong opinions about the content and the dice, there wasn’t time to do the proper review, analysis, and commentary that was warranted.

Having had some time since to digest and do a fair amount of math, I approach this as a case study, similar to the ones my business and stat teachers were fond of handing out: What analysis was done? Was it the right analysis? Was it done correctly? The point of a case study is a roleplaying exercise. You are taking the role of a pretend consultant and are handed a situation in progress. What do you do?

There’s a lot to unpack here so this article is part 1 of a 3 part series.

I wanted to write this article for two reasons:

First, reviewing what was done and suggesting areas for improvement provides a resource for those doing this type of analysis in the future. The internet is full of resources explaining how to do a chi-square Goodness of Fit test. Less common are resources explaining how to do a test of homogeneity. Rarer still are resources that explain why you would do one test or the other, and how to prevent and express the results.
Second, this is a half million dollar Kickstarter for game accessories, which puts it in a small subset of Kickstarters. Kickstarter’s own stats break their funding categories into 100k to less than a million and million plus. At the time of this writing, there have only ever been 4,268 100k plus gaming Kickstarters, putting Honest Dice in a small and impactful group. It only makes sense to review its claims and analysis carefully.

Before I start the actual statistics case study, there are a few points I’d like to make that fall outside the statistics:

Disclaimer: I have not bought, received, or even ever seen in person or touched any of these dice. I have not collected my own rolls, and cannot truly vouch for them. However, I’m inclined to take the evidence that has been presented at face value. You’ll see later during the review of the statistics why I think their collected data is likely to be legitimate and because of this, I feel pretty comfortable saying that these seem to be high quality, attractive, highly durable dice. They’re also expensive, even more expensive than you might expect for what they are. As such, if you like the look of them, if you like their unique features, and if you’re comfortable dropping the kind of cash the creator is asking for them, I have no reasons to tell you to not buy them. While the Kickstarter is already over, late pledges are still open. Also, the creator has a web store where you can buy many of their existing dice.

There were several non-stat issues I had with claims that were made in the Kickstarter. Most were inconsequential enough to leave out here, but one in particular struck me as disingenuous enough that while I am writing this article I felt I had to address it. At the start of their statistical analysis they make this statement: “I decided not to name the brands of the other dice. While I’m in favor of complete transparency, I’m also not trying to throw shade at other dice companies.” That’s admirable. The part I feel is disingenuous is that they immediately directly quote marketing slogans from the competitor in question, which is effectively exactly the same thing as naming the company outright. One cannot have this both ways. A company can name their competitor during comparison or not, but attempting to frame not naming them as taking a moral position makes an immediate violation of that stance into a moral position as well.

Finally, I’d like to briefly touch on the mathematical balancing of the numbers on the die faces that is a major feature of these dice. I think this is a very interesting topic and a similarly interesting topological problem. The section of the Kickstarter where they discuss how they determined their optimal arrangement is fascinating. In addition, I feel that this is something that could potentially enhance the fairness of some dice and that the particular arrangement of faces they came up with and the process of solving for optimal arrangements is something they should patent immediately if that is indeed a thing that can be patented. However, I do feel I have to point out that if the manufacturing standards to which the dice are held successfully create a die with probabilities very close to the ideal distribution, then rolling the die will result in a random distribution of faces very close to the ideal distribution as well. If the distribution of rolled faces are very close to the ideal distribution, then which number is engraved on which face doesn’t matter. So this ideal arrangement of numbers doesn’t do much to help the accuracy of an already highly accurate die. On the other hand, this arrangement of faces might ironically be highly useful to a company making lower quality lower accuracy dice.

So to start our case study, let’s look at what one ideally would do for a project like this:

Consider Purpose: The first thing you would do is decide what you want to accomplish with your testing. Do you want to test your own dice for table fairness? Do you want to make an aggressive test of your dice to really assess their accuracy? Do you want to be really certain about your results or are you okay with some wiggle room? Do you want to test your dice against a competitor’s dice? How much time and effort do you want to put into this? All these things are going to impact decisions you make later.
Design/Choose Tests: Once you have a good idea of what you want to do, it’s time to start designing your analysis. You’ll need to decide on how you’re collecting data, how much data you’re collecting and what tests you’re going to perform. Typically at this point you’re also choosing your threshold for statistical significance. It’s probably easiest to choose these elements in this order:
- Tests/Questions: Every test is based on a pair of Hypotheses. One of these, called H0 or null hypothesis is a default assumption and always includes an equal sign. The math of the test assumes this hypothesis is true and the results of the test are the likelihood that one would observe a sample as unusual as the data that was collected, if that hypothesis were true. If we see a really rare event, then we have evidence our assumption was wrong and we reject H0. The typical tests for dice are:
  - Chi-Square Goodness of Fit Test: This is a test of the hypothesis that the distribution of the die you are testing is equal to the ideal distribution for a die of that type. So, for example, for a D20, H0 is “The probability of rolling each face of the die is 1/20” and observing a rare event rejects that hypothesis.
  - Chi-Square Test of Homogeneity: This is a test that the distribution of two or more dice (or other phenomenon) are the same. So for a relevant example, you would use this test to check if the distribution of a precision machined metal D4 is the same as the distribution of a low impact plastic cast tumble painted D4. Your H0 would be “The probability distribution for all of the dice in this test are the same.” and observing a rare event rejects that hypothesis.
    Why don’t we just test each die with the Goodness of Fit test and see which one tests better? Well, imagine a situation where your high quality expensive precision D4 has a random string of unusual rolls. Maybe you roll a bunch more 4s than you would expect. At the same time your low quality cheap inaccurate D4 has a string of really good rolls. If you look at those samples together it might look like your low quality die was better than your high quality die. But really, all you did was observe two rare samples back to back. That happens from time to time, but two separate goodness of fit tests have no way of determining if that might be the case. The test of homogeneity on the other hand takes into account the variance of the dice to help prevent these incorrect assessments. Remember that when you perform a test of your dice you’re only selecting one possible set of rolls of that size and set of results all range from very likely to very unlikely. The only way you know which set you have observed is to observe multiple samples or observe large enough samples. A single small to medium sized sample is an indication of what your true value may look like but isn’t always consistent.
    Note that the test of homogeneity only tests “H0: All of these dice are distributed exactly the same.” and rejecting that hypothesis only concludes that there is evidence that at least one of them is different than the others. Unless you are only testing two dice against one another, the test makes no claim about which ones are different from each other and the magnitude of those differences. Those follow up questions will require additional tests of their own and you should have those tests planned and accounted for before you run the first test. If the original test that tests all dice against each other finds no significance, there’s no need to actually run any of the planned follow up tests since there’s no evidence of individual differences to find if the first test isn’t significant. We’ll see an example of this later.
  - Chi-Square Test of Independence: Sometimes you’ll find explanation of this test alongside the Test of Homogeneity. If you’re saying to yourself “Huh. The statement ‘These dice all have homogenous distributions’ and the statement ‘the distribution of these die rolls is independent of which die they came from’ sure sound exactly the same.” congrats. That’s an astute observation. These tests are really poorly named. In general you almost never use the test of independence for die testing, but here are a few facts that will help make up your mind:
    - Mathematically, the tests of Homogeneity and Independence are identical, so if you’ve accidentally done the wrong test, no big deal. You’ve also done the right one.
    - The Test of Homogeneity tests if two different samples have the same distribution. Since they’re two different samples, they don’t have to have the same sample size. You can roll your first die 500 times and your second die 5000 times. It’ll make any attempt to do power calculations (the math to see how often you detect differences if they exist) weird but you can do it. Which means you can in theory trawl the internet picking up random D20 data sets from various manufacturers and mush them into one giant test of homogeneity.
    - The Test of Independence on the other hand tests multiple factors collected from a single sample. For example you might want to test if suit and number in a deck of cards are independent or if the temperature at which dice are rolled impact distribution, or some other combination of factors. Unlike Homogeneity, with this test every time you do a test it generates a set of data with a measurement of each factor of interest. Thus it would be really hard to have different sample sizes for different factors.
  - Other Tests: Those are the most common tests that are performed in dice evaluation but others might be used if you’re trying to answer questions other than “How well does my die match the ideal distribution?” or “Is one of these dice better than another?” In fact, you can make up your own tests as long as you can define a hypothesis and then are able to figure out the math for: Assuming the hypothesis is true, what is the probability we observe a sample as or more rare than the sample we actually observed?
- Threshold for Significance: A test is said to be “Significant” if we assume that H0 is true and, if that is the case our collected data is too rare. What is “too rare”? Well, that’s up to you. You can set this threshold wherever you want, but there are some factors to consider:
  - Pick Before You Test: You should run no risk of your collected data influencing where you place your threshold for significance. If your test results show an observed rarity of 2%, do you want to wait till that moment to decide if 5% or 1% is your threshold for too rare?
  - Use One Threshold: Once you’ve chosen a threshold, that should be the threshold for your entire set of tests unless there’s a good reason to change from test to test (we’ll discuss that shortly). Without excellent justification, changing thresholds, especially if your data is borderline and you change in such a way to find results that are favorable to you, smacks of dishonesty and an attempt to manipulate results in your favor.
  - Use Common Standards: The standard thresholds are .1 (10%), .05 (5%), or .01 (1%). It’s possible to use other thresholds, nothing’s stopping you. However, larger thresholds come with larger chances of making an error (we’ll discuss that in a moment) and start detecting common results as “too rare”. Remember that rolling a 1 on a d10 for example has a 10% chance of occurring. Does it really make sense to say that an event more common than a standard d10 roll is so rare that observing them must mean your assumptions are wrong? On the other end of the spectrum, there are applications where thresholds smaller than .01 are appropriate. These are commonly seen in applications where lives or large sums of money may be on the line such as medical research or insurance calculations. In general, die testing doesn’t warrant this level of accuracy unless you have extraordinary amounts of money riding on them. I haven’t even heard of casinos testing their dice to levels of accuracy tighter than .01 but it wouldn’t surprise me if they did.
  - Consider Family-Wise Error Rate: Let’s say you decide your threshold for significance in a test of goodness of fit is .05 which is the same as 1 in 20. That means that if you assume your die has the ideal distribution and your set of rolls are as or rarer than 1 in 20, you’ll say the results are significant and reject that the die is fair. Now let’s say you have 100 identical ideally distributed dice and you test each of them, rejecting each die for which you see a sample with 1 in 20 rarity. How many of those 100 tests are you going to reject, on average? You’d expect to see an event of 1 in 20 rarity about 1 20th of the time, so in our theoretical 100 tests of fair dice, you would expect to reject 5 of them specifically because they follow the expected distribution. These are called Type 1 errors. When you choose your threshold of significance, another way of thinking about it is that your threshold is the proportion of time you’re willing to make these Type 1 errors. But if you’re performing more than one test at a given threshold then as in our example above, your chance of making at least one Type 1 error increases over your desired threshold as the number of tests increases. This increased chance of error is called your Family-Wise Error Rate and you can control for it in a number of ways. One such way is mathematically calculating a new threshold for individual tests that result in your overall Family-Wise Error Rate being equal to your original desired threshold. Another is using step by step procedures to reject individual tests and fail to reject others based on p-values and other factors. Discussion of individual methods is beyond our scope here other than to say: be careful when deciding if you can assume individual tests are independent or not. Explanation of a handful of procedures can be found here.
    To determine if you need to worry about Family-Wise Error Rate, ask yourself if any of your tests are related groups of tests and if increasing the number of tests will increase the chance of accidental error.
- Sample Size: Now that you know what kind of tests you’ll be performing and at what level of significance, you can finally determine the size of the sample you’ll need to generate in order to properly run your tests. Remember that the more data you have, the smaller a difference you can detect with greater accuracy. Unfortunately, most of the tests you’ll use in die testing are chi-square tests and determining necessary sample sizes for chi-square tests can’t reasonably be done without software. If you’re confident in your R or Python abilities, several packages exist for those. However, a stand-alone product exists for those who don’t want to bother installing a programming language and code editor and figuring out how to code a solution themselves. G*Power has good reviews online, is a simple stand-alone product and will handle what we need. If you choose to use G*Power, once you download it and start it up, you’ll need to choose Test Family=X2, Statistical Test=Goodness of Fit Tests: Contingency Tables, and Type of Power Analysis=A Priori: Compute Required Sample Size – Given alpha, power, and effect size. Then simply enter the requested information and click calculate.
  - Effect Size: For Chi-Square tests standard effect sizes are .1=small, .3=medium, and .5=large. A smaller effect size means that you’ll be able to detect smaller differences between the distributions you’re comparing to one another.
  - Alpha Err Prob: This is your threshold for significance discussed above. Remember that if you’re dealing with Family-Wise-Error, even if you’re planning on using step procedures to deal with determining significance, you’ll still need an actual alpha here. Depending on if your tests can be said to be independent or not, the easiest option are the Bonferroni or Sidak procedure.
  - Power (1-Beta err prob): Power is the probability that you’ll be able to detect the difference of interest if it actually exists. Sampling involves a lot of random noise so even if the difference you’re looking for exists some samples you generate will make it look like it doesn’t. The higher your Power, the more samples will detect differences correctly, but also the bigger those samples will have to be. Standards include .8 (detects differences 80% of the time if they exist) and .95 (detects them 95% of the time if they exist.)
  - Df: This is the degrees of freedom of your test. For Goodness of fit tests it’s the number of faces your die has -1. For tests of homogeneity it’s (number of faces-1)(number of dice you’re comparing-1). So for example a goodness of fit test on a d20 has 19 degrees of freedom, while comparing a set of 4 different d20s together has 19*3=57 degrees of freedom.
Collect Data: Now that you know how much data you need and what you’re going to do with it, it’s finally time to start collecting data. This order is important because you neither want too little nor too much data. Too little and you have insufficient power to detect deviations. Too much data and you’re going to detect really tiny differences. That sounds great until all your dice test as not fair because you collected 15,000 rolls each. When you collect data, in general it’s easiest to simply keep a running tally of the count of each face rolled. Some people like to keep an actual record of every roll that was made. That’s fine too, although the first thing you will do with a list of rolls is tabulate it into a count of each face. Here are two potential approaches you could take with data collection. Either way is fine. You’re just measuring slightly different things:
- In one approach you could attempt to gather data in the environment that best describes your use case. Sit at your dining room table, maybe scatter some books, papers, and soda cans around as obstacles and get rolling. The idea here is that your data will do a good job representing the performance that the average home user sees.
- In the other, you can attempt to gather data in an ideal environment: attempting to optimize for bouncing, well lit for accurate reading of results, and using as close to the same force and angle for each throw as possible for consistency. This approach is an attempt to capture as close to the true distribution of faces of the die as possible. The fact that the end user will never experience these idealized results is irrelevant.
Analysis: Surprisingly there’s not much to say here. You have your planned tests. You have your parameters. You have your collected data. The analysis is just math. The math is simple enough for most Chi-Square tests that you can literally do it with pencil and paper (and if you’re like me you’d find that fun but would also make dozens of simple errors and then have to hunt them down.) In practice, this is almost exclusively done via one form of software or another. The end result of analysis is almost always a p-value and a statement about the original claim for each test, something like: “p-value=.003 Evidence exists to reject the claim that all the dice tested share the same distribution.” We use p-value for most test results because they are familiar to many people, they are fairly simple to understand, and they are standardized such that completely different tests, even different types of test can be compared easily. P-values by the way literally mean “If our H0 hypothesis is true, this is the probability of observing data as or more rare than we did.” For the final statement, keep in mind that actually proving something with statistics is almost impossible. Disproving things is sometimes just as hard. Instead most statistical results make use of phrases like “Evidence exists in support of the claim that…” or “Data suggest that…”
Presentation: Sharing your results is a straightforward process. You simply walk through this same process and explain what you did.
- Discuss what you were attempting to test. This part can be in plain language. Explaining the problem or question.
- Explain your analysis plan including tests, threshold for significance, and sample size. You may want to discuss why you designed your plan the way you did. This is also a good place to discuss common statistics concepts and definitions if you feel your audience may not be familiar with them.
- If you did anything fancy for data collection, you will probably want to share it. You may want to share graphs of simple descriptive statistics of your data. It’s also a good practice to provide your actual data set if there’s no good reason not to.
- Go over each test you performed and their results. As mentioned earlier, if possible, these should mostly be just p-values and statements of what those p-values say about your initial claims.
- At the end you may wish to close with an overview especially if your results show an important trend or pattern. Even if your results aren’t particularly impressive, a conclusion like “Results are inconclusive, further testing with larger sample sizes is needed.” can be useful as well as suggestions for follow up tests that could be performed.

So that’s the general approach for testing dice. This isn’t comprehensive. There are quite a few bits I glossed over but for our case study it’s a sufficient framework to work with.

Next week is part 2: Review of the Honest Dice Analysis.

The week after that is part 3: Suggested Analysis.

https://gnomestew.com/a-case-study-in-dice-stat-tests-part-1-general-approach/

https://gnomestew.com/?p=52899

solarbird

I’ve got a post going up tomorrow (Monday) morning, but the word you need to understand right now is:

Remigration

If you don’t already know this word – or if you’re in certain areas of academia and think you do, but do not in any context outside academia – you need to know what this word means right now. And you need to make sure your friends know what it means.

Wikipedia will explain it to you.

More tomorrow.

Posted via Solarbird{y|z|yz}, Collected.

canyonwalker

Every so often I get inspired by one of those Buzzfeed listicles, "We asked 10 bartenders what shit beer they like." Of course they don't actually call it shit beer; they use some euphemism like "cheap beer" or "traditional American style lager". Y'know, the pisswater American macrobrews like Bud and Miller, except not Bud or Miller. Usually. That's how I was inspired to try a 12-pack of Miller Hi-Life a few years ago as part of my ongoing beer tasting project. Verdict: pisswater beer was kinda... pisswatery. Well, after a few years I was motivated by another of these "shit beer that online randos say they like" listicles to try Narragansett Lager.

Trying Narragansett Lager (Jun 2024)

Those of you of a certain age who hail from Rhode Island, all maybe 200,000 of you, may object to my lumping Narragansett in with shit beers. 'Gansett is your state brew! There was a time decades ago when regional beers were more prominent— and generally they were better, if even by a little bit, than the nationally distributed swill of Bud, Miller, or what Schlitz degenerated into.

The fact that Narragansett is a regional beer from all the way across the country made me wonder if I'd be able to find it here in California. It turns out it's not so hard. My trusty regular source, Total Wine & More, keeps it in stock. I picked up a 6er of it when we were out last weekend.

The first thing of note about Narragansett is that it come in 16oz. cans. It sells for a price similar to packs of 12oz. cans, though, making it a bit of a deal right there.

The next thing I noticed, when I poured it into a glass, is that it foams into quite a head if not poured carefully. The glass pictured (right/above) is an 18oz glass, and that's not even the full can I managed to pour in before the head filled it up.

So, how does it taste? In a word: Uhh....

Uhh is because this beer starts out nasty. It has a sour, skunky taste at first sip. Fortunately that clears up quickly and the beer becomes somewhat pleasant, if bland. But still, starting out icky and then only improving to "bland" is hardly a recommendation.

Maybe I should stop trying to believe these Buzzfeed listicles that the best shit beers are any good.

sorcyress

I have two days left of school, which is probably good, and then I slam into summer with several weeks of Very Exciting. I'm going to drive to MD and back in the span of about four days. Somewhere in here I need to finish preparing for my ESCape classes. I have to pack?!?! Maybe all my stints on crew mean I'll be able to pack a lot lighter than usual for ESC/Scots (lolno).

Tomorrow I should figure out everything that needs to be done in the school building before I depart for the year. Packing up. Printing things. Recycling many many pieces of paper. Submitting grades and paperwork and the like. (The printing things is sneakily a "prepare for ESCape" thing to do --I want to have some properly formatted dances or What Have You. I seem to recall I did quite a lot of this as a work-in-company with Veronica last summer, I probably won't have the MD time to do that again, but it's worth noting that MD does have printers. And so does MA. It's not the end of the world if I fail to print things, is what I'm saying.)

I've been playing lots of Stardew Valley, which I'm quite into, even if I'm getting a little stuck on some of the plot bits. Not looking anything up is a bit of a drag, but also feels very very good when I do figure things out myself. I am glad to have realized I can go back through the library and reread the books I've found --I was worried those were one shot readings, and I'd forgotten the early ones. Anyways, no spoilers, I'm a bit into my second summer.

Went to service ringing today, for probably my last bells for a month. So that's...a lot. Luckily it did not emotionally overwhelm me, maybe partly because I've had a low-key headache for a few days now. I arrived at Advent in time to help ring down, and then we did a mostly nice extent of Cambridge at Old North. (It was only mostly nice because everybody spontaneously exploded when we got to the plain course, and we limped through and barely made it. Sigh.)

Yesterday was pride and I wrote up a bunch of thoughts and should probably post them somewhere, instead of just manually sending them to various partners who like hearing about my life. I suppose there are others of you who like hearing about my life as well or whatever.

I finally picked back up the Endless Photo Organization Project the other day, and sorted a handful of photos. I forgot to note the number at the beginning, so I have no idea how many things got tagged. It's nice to get back into the swing of the project, and extremely nice to have set the whole thing up on the external harddrive, so switching it from old computer to new was essentially trivial. I wish I could figure out a few more keyoard shortcuts to speed things up a tick, but I'm feeling pretty okay with the mousing that has to be done.

Having finished words, I should do at least one of the dishes and going to sleep. Goodnight!

~Sor

MOOP!

canyonwalker

Two years ago I bought a straw hat. It's well made and looks great. I liked it so much I signed up for the hat maker's mailing list. That was fine for two years... until recently. Recently they've been emailing me with hat updates, like every day.

And these aren't even earth-shattering "Check out this crazy new hat!" type announcements; they're "Hey, have you ever seen a plain fucking standard ballcap in black?" Like, OMG, yes, a billion times. Then the next day: "Stand out from the crowd in this gray ballcap!!1!"

WTF is with a hat company spamming me every day to buy a hat? Especially a completely generic hat? Why do companies abuse communications like this? If they were still sending me one message a month, I'd read it. Now I've unsubscribed and they've lost a customer.

Profile

canyonwalker

June 2025

S	M	T	W	T	F	S
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
29	30

Page Summary

xkcd_feed - Interoperability
gnomestew_feed - Collaborative Storytelling Loop
canyonwalker - Being a Slug on My Day Off
canyonwalker - NOT Leaving on a Trip Tonight
sorcyress - Life, rest, mostly Stardew
xkcd_feed - Tukey
gnomestew_feed - Gnomecast 215 – Getting Them to Play Something Else
canyonwalker - Better Call Saul 6.09: Kim Leaves
sorcyress - Summer!
solarbird - hey team north end!
canyonwalker - The Beer that Inspired a Movie
xkcd_feed - Exoplanet System
solarbird - Let’s talk “Remigration”
canyonwalker - Quiet Weekend at Home. Still Getting Over/Through Colds.
oots_feed - 1328: Facing Rules
gnomestew_feed - A Case Study In Dice Stat Tests Part 1: General Approach
solarbird - A word you need to know right now
canyonwalker - Trying Narragansett Lager
sorcyress - (no subject)
canyonwalker - How Often Do They Think I Buy a Hat?

Style Credit

Style: Neutral Good for Practicality by timeasmymeasure

Expand Cut Tags

No cut tags

Page generated Jun. 21st, 2025 03:03 am