Scroll sawers put blades to the statistical test by cutting out ducks

Years ago I helped Quality Assurance Manager John Engler solve a tricky issue at Robinson Rubber via design of experiments (DOE). He contacted me last fall to help him apply DOE to a nagging question about scroll sawing: Does it pay to buy pricier blades?

We worked together to design a simple-comparative randomized-block experiment on 10 competitive blades. John enlisted 20 fellow hobbyists in his NorthStar Scrollers club to cut out a duck from pine (see pattern below) using the selected blades (such as the one taped on the board) in a random order.

They then rated the results on a 1 to 9 scale—higher the better—for speed of cut, blade jumpiness, fuzzies (undesirable!), edge smoothness, burns and line following.

Scroll saw ready to cut out a duck

The blades differed significantly by all attributes at p < 0.0001 other than the line following (p = 0.3419). For the most critical measure—speed of cut, blades 3, 8 and 9 stood above all others on average.

The power of doing 21 replicates—widely spread as indicated by the red dots—and, furthermore, blocking out the scroller-to-scroller differences, is seen by the narrowness of the least significant differences (based on a p of 0.05).

Accounting for all the attributes via Stat-Ease software’s multiple response optimization these three blades held up overall with number 3 being the winner by costing less than the other two.

After I reported my findings to the group, John laid out a number of mitigating factors:

  • Experience of the scroll sawers
  • Type of wood, e.g., something a lot harder
  • The life of the blades (important to consider for the cost)

But all-in-all, this planned experiment proved to be a big hit with the NorthStar Scroller hobbyists. What impressed me was their depth of knowledge on scroll-saw blades and why we observed such significant differences due to the patterns and orientation of their teeth, etc. I was also struck by how some individuals could tell right away which blades worked best—even before seeing the entire set of data. This reinforces my feeling that laying out and analyzing experiments works best by combining the know-how of a DOE expert (like me) with subject matter experts (not me in this case—far from it!).

“This went much better for me than I thought it might and I learned some things about blades along the way. This was fun!”

–Helen (a NorthStar Scroller blade-tester)

No Comments

Classic case of sensory testing snubbing off a beer snob




The feature story on sensory evaluation in the new issue of ASTM Standardization News brings back a fond memory of a rare victory over an overly smug colleague.

I developed a taste for sensory science as a young chemical engineer determined to prove that mass produced American lagers differed only imperceptibly—consumers being brain washed by deceptive advertisers. This hypothesis drew strong condemnation from one of my colleagues—a chemist named Harold who dissed lesser brews such as Old Milwaukee, which he deemed “Old Swillwaukee”.

To put this beer snob to the test, I organized a tasting at a Super Bowl party attended by a dozen or so fellow researchers. Beforehand, I engaged a sensory professional that our employer hired to guard against “off odors” from our manufacturing plants. She advised that we limit drinking of each beer to a small sip, then eat saltless crackers and wash them down with water before going to next brew. Also, both the presenter of the beer and the taster should be blind to the brand, thus avoiding bias.

However, given my mission to snub a beer snob, we first rated a selection of undisguised beers—including Miller, Budweiser, Old Milwaukee and others (in those days there were no ‘craft’ brews*). Harold rated “Old Swillwaukee” dead last. That was my plan! Then we repeated the tasting with the order re-randomized, but this time not revealing the names. Harold rated Old Milwaukee at the top of his list, thus providing a Super Bowl victory for me (badly needed being a Vikings fan).

My conclusion from this experience, and my work over the years helping food scientists improve the taste and other attributes of their products, is that it would be best to adhere by ASTM’s upcoming revision to Guidelines for the Selection and Training of Sensory Panel Members. For beer and the like, then bear down on the Standard Guide for Sensory Evaluation of Beverages Containing Alcohol (E1879).

We make panelists learn chemical names. For example, isoamyl acetate is a specific compound that smells like candy banana…I make panelists drink heavy cream for mouthfeel attributes. They’re unfazed by whatever we give them anymore because it’s always weird.

– Ali Schultz, sensory manager, New Belgium Brewing Company and leader of the current revision to E1879 (“Accounting for Taste”, ASTM Standardization News, January/February 2024)

However, if you are having a party, it’s more fun to be unprofessional and ignore the mandates to sip and spit, etc. ; )

*PS: The specialty beer brewers are getting a bit out of control nowadays, IMO. For example, I just got an alert from my Stillwater, Minnesota neighborhood microbrewer Lift Bridge to their release this weekend of Taking Care of Breakfast—a “barrel aged imperial breakfast stout aged in 10-year Willet and 6-year Wild Turkey bourbon barrels, infused with peanut butter and banana chips.” This new brew comes in at 12% ABV. Perhaps it may be best to go with orange juice first thing in the morning.

No Comments

To bean or not to bean, that is the question for coffee

In my most recent blog post on coffee I reported that a finer grind may not always be better. Now another piece of the puzzle for producing java that jives falls into place: Spritz your beans with water.

Evidently this is not a new discovery—those who really know their coffee-making craft routinely moisturize their grind to reduce clumping. A new study reported here by New Scientist reveals the problem: static electricity. Following up on the link to the original publication, I see that the research team, led by a volcanologist (sensible considering the lightning generated by particle-laden eruptions), deployed this $3000 German-made, handcrafted machine to produce extremely uniform grinds. I will definitely buy one soon (after winning the lottery).

Another approach to better coffee takes a completely different route—create it from cells grown in bioreactors. Environmentalists like this because the demand for sun-grown beans leads to destruction of rain forests. Per this Phys.Org heads-up, a Finnish team just released a recipe to accelerate the creation of a new “coffee ecosystem.” This seems promising. But there is a problem: Though the current lab-grown concoctions contain twice as much caffeine as ever before, it remains much lower than those in farmed beans.

Another approach to avoid the problems keeping up traditional methods for making coffee is to go to a beanless brew, such as the imitation now being rolled out by Seattle-based Atomo Coffee. Based on this January 24th report by CBS Saturday Morning show, I would be willing to give it a try, especially given they load up their brew with caffeine at the upper end of the normal range of real coffee. Full steam ahead!

One last idea (my caffeine levels now running low) for improving the taste of coffee is being selective about the shape and material of your cup. For example, see what the Perfect Daily Grind says about pouring your brew into a wine glass or other specialty containers.

“A drinking vessel has a significant impact on perception of flavour and aroma because it changes the way the coffee smells and tastes, as well as how you drink coffee. What’s more, our senses, feelings, and emotions also impact how we experience coffee.”

Marek Krupa, co-founder and CFO of Kruve

No Comments

Being ‘bird-brained’ merits reconsideration

First off, writing this blog from my winter home in Florida, I appreciate the sensibility of snowbirds who abandon their northern climes every fall. Smart! Furthermore, studies show that avian brains, specifically crows and ravens (collectively known as “corvids”), can accommodate statistical thinking—a skill that many humans lack based on my experience as an educator. Researchers from the University of Tübingen worked this out via a clever experiment that required crows to assess the probability of getting a treat based on prior experience pecking at differing images.

“True statistical inference requires subjects use relative rather than absolute frequency of previously experienced events. Here, we show that crows can relate memorized reward probabilities to infer reward-maximizing decisions.”

Johnston, et al, Crows flexibly apply statistical inferences based on previous experience, Current Biology, Volume 33, Issue 15, 7 August 2023, Pages 3238-3243

This gives new meaning to the saying that “if the p-value is high, the null must fly.”

No Comments

Chance discovery on random walk in Utrecht

Last week I taught a class on design of experiments to a biotech company in Leiden, Netherlands. Afterwards I spent a few days in Utrecht with some friends from Germany. Imagine my excitement (nerd alert!) when on my first walk from our hotel to the city center just a few hundred feet down the sidewalk I encountered this mural featuring a differential equation.

Not being a physicist, I did not immediately grasp the formula’s importance, nor the clue provided by the fellow high-stepping down a street. It turns out this fellow is a drunk whose walk has become random. The mural, as explained by Utrecht University, pays homage to their famous professor Leonard Ornstein who, in the early 1900s along with another physicist—George Uhlenbeck—developed an important variant of the “random walk”—a term introduced by pioneering statistician Karl Pearson. The Ornstein-Uhlenbeck process is used to derive models from “big” financial data, including inflation rates, commodity prices and stock values.

I did not expect to gain an education on a vacation expedition.

Very cool!

PS: I thought about asking my colleague Martin Bezener, a PhD statistician, for his opinion on the chances of coming across something so relevant to our mission at Stat-Ease while on a random walk. But I will not bother, because I already know what he would say: “One-hundred percent: It already happened”.

No Comments

British system of messed up measures hilariously skewered

Upon graduation as a chemical engineer in 1975 I took a job as a process developer at a California oil company. There I learned that a barrel amounted to 42 gallons—not the 55 in the drums at my previous employer—a specialty chemical company. In the wacky British system of volumes, the number of gallons in a barrel depends on the material—31 for beer, 53 for rum (yo ho ho!), 60 for wine, etc. Their weights and distances are just as unfathomable (pun intended).

That same year of 1975 that I first became employed as a degreed engineer, President Gerald Ford signed the Metric Conversion Act, which went nowhere before being abolished in 1982 by President Reagan. Having endured all this measurement mess throughout my career, I thoroughly enjoyed this October 28 Saturday Night Live skit:

I raise my US pint (16 fluid ounces) of beer to SNL’s clever comedy writers, though a British pint (20 fluid ounces) would be more filling and a liter even sweeter. Let’s not get into US versus Imperial ounces (or gallons)—that would get us over our head by at least a fathom (equivalent to 4 cubits, by the way).

PS. This rant about measures brings me full circle to an outburst at the outset of this year involving a whimsical unit of distance called the ‘smoot’.

No Comments

Swedish sleep researchers torture subjects with math problems

This is alarming news, literally: Researchers from Stockholm University discovered via studies involving over 1700 subjects* that over two-thirds of them, especially younger individuals, habitually hit the snooze button.

I am appalled at this lack of discipline and ambition! However, I must confess that in my younger days, I got in the habit of putting my alarm on temporary pause repeatedly, which often caused me to run late for class. That would not do! Therefore, I purchased a cleverly built clock called the Clocky that rolls away when ringing, thus forcing you to jump out of bad to hunt it down. Highly recommended!

Putting aside my negative attitude about snoozers, I do feel bad for those subjected to the sleep study because as reported by the New York Times: “Immediately after the participants woke up, the researchers flipped on the lights and presented them with math problems and other cognitive tests — a challenge even more grating than a shrieking alarm, and one the participants had to complete before having a cup of coffee.”** Oof!

The good news for you slackers who do not leap out bed like I do is that this new study provides a pass for delaying the inevitable: “Snoozing [for 30 minutes] does not lead to cognitive impairments upon waking.” Just do not sleep through your final exam on math. That would be a nightmare!

*Is snoozing losing? Why intermittent morning alarms are used and how they affect sleep, cognition, cortisol, and mood , Journal of Sleep Research, October 17, 2023.

**“You Snooze, You … Win?”, Dani Blum, Oct. 18, 2023.

No Comments

Variation in eggs presents perplexing problems for preparation

Today is World Egg Day.

I’m a big fan of eggs—my favorite being ones perfectly poached in an Endurance Stainless Steel Pan. However, the eggs that come from my daughters’ hens vary in size far more per container than store-bought, graded ones. I work around this by adding or subtracting time based on my experience. I really should weigh the eggs and design an experiment to optimize the time.

Coincidentally, I just received the new issue of Chance, published by the American Statistical Association. An article titled “A Physicist and a Statistician Walk into a Bar” caught my eye because one of my Stat-Ease consulting colleagues is a physicist and another is a statistician. I was hoping for a good joke at both of their expense. However, the authors (John Durso and Howard Wainer) go in a completely different direction with an amusing, but educational, story about a hypothetical optimization of soft-boiled eggs.

The problem is that recipes suffer from the “flaw of averages” —smaller ones get undercooked and bigger ones end up overcooked unless the time gets adjusted (as I well know!).

While the physicist sits over a pint of beer and pad of paper scratching out possible solutions based on on partial differential equations related to spheroidal geometry, the statistician assesses data collected on weights versus cooking time. Things get a bit mathematical at this point* (this is an ASA publication, after all) but in the end the statistician determines that weight versus cooking time can be approximated by a quadratic model, which makes sense to the physicist based on the geometry and makeup of an egg.

I took some liberties with the data to simplify things by reducing the number of experimental runs from 41 to 8. Also, based on my experience cooking eggs of varying weights, I increased the variation to a more realistic level. See my hypothetical quadratic fit below in a confidence-banded graph produced by Stat-Ease software.

Perhaps someday I may build up enough steam to weigh every egg, time the poaching and measure the runniness of the resulting yolks. However, for now I just eat them as they are after being cooked by my assessment of the individual egg-size relative to others in the carton. With some pepper and salt and a piece of toast to soak up any leftover yolk, my poached eggs always hit the spot.

*For example, they apply Tukey’s ladder of variable transformations – a method that works well on single-factor fits and can be related to the shape of the curve being concave or convex, going up or down the powers, respectively. It relates closely to the more versatile Box-Cox plot provided by Stat-Ease software. Using the same data as Durso and Wainer presented, I found that the Box-Cox plot recommended the same transformation as Tukey’s ladder.

No Comments

Data detectives keep science honest

An article in Wall Street Journal last week* drew my attention to a growing number of scientists who moonlight as data detectives sleuthing out fraudulent studies. Thanks to their work the number of faulty papers retracted increased from 119 in 2002 to 5,500 last year. These statistics come from Retraction Watch who provide a better, graphical, perspective on the increase based on percent retractions per annual science and engineering (SE) publication–not nearly as dramatic given the explosion in publications over the last 20 years, but still very alarming.

“If you take the sleuths out of the equation it’s very difficult to see how most of these retractions would have happened.”

Ivan Oransky, co-founder of Data Colada –a  blog dedicated to investigative analysis and replication of academic research.

Coincidentally, I just received this new cartoon from Professor Nadeem Irfan Bukhari. (See my all-time favorite from him in the April 27, 2007 StatsMadeEasy blog Cartoon quantifies commitment issue.)

It depicts statistics as the proverbial camel allowed to put its nose in the tent occupied by science disciplines until it become completely entrenched.

Thank goodness for scientists like Nadeem who embrace statistical tools for design and analysis of experiments. And kudos to those who guard against faulty or outright fraudulent scientific publications.

*The Band of Debunkers Busting Bad Scientists, Nidhi Subbaraman, 9/24/23

No Comments

Temperature combines badly with humidity to maximize misery

The Twin Cities tied its record high temperature yesterday at 97 degrees Fahrenheit. However, the winds blew strong with air at a dew point in the low 60s, which made the heat relatively tolerable. After spending most of August at our second home in southwest Florida (leaving there just as Hurricane Idalia hit), my wife and I got acclimated to a far more uncomfortable daily combination of heat and humidity.

Before departing for Minnesota, I set up a SensorPush to monitor temperature, humidity and dew point—the temperature at which air becomes saturated with water vapor. I want to be on guard for the air conditioning going out. If that happens in Florida homes, mold can grow. After experiencing this once (due to renters not running the A/C) and dealing with an expensive remediation, I am keen to prevent another episode.

Closely related to dew point is the wet-bulb temperature, which, as chemical engineer, I learned how to measure with a sling psychrometer. The wet-bulb-temperature can then be converted to relative humidity.

To prevent heat-related deaths in training camps, the US military developed a more sophisticated measure called the wet bulb globe temperature (WBGT). It provides a good measure for the advisability of being active in full sun. The Texas University Interscholastic League requires that outdoor practices be shut down if WGBT exceeds 92.

“As with all indices that integrate elements of the thermal environment, interpretation of the observed levels of WBGT requires careful evaluation of people’s activity, clothing, and many other factors, all of which can introduce large errors into any predictions of adverse effects.”

– Grahame M Budd, Wet-bulb globe temperature (WBGT)–its history and its limitations

Other measures use to gauge comfort are Heat Index and Feels Like Temperature (FLT). I like the FLT because it accounts for the benefits of evaporative cooling. For example, as I write this, the actual temperature is 95 degrees and the FLT is only slightly higher at 96.

I’m getting too hot and bothered with all these measurements to continue much longer, but here’s yet another approach used by AccuWeather—the RealFeel Temperature.

What really matters is how you feel and what can be done to avoid discomfort. For example, earlier this summer I went to our Minnesota’s Washington County Fair on a very hot day and stopped in at a beer garden for a cold brew. However, I soon realized that its hot tin roof radiated heat down to the picnic tables—overcoming any advantage to being in the shade.

Sometimes you can find no relief other than hunkering down in an air-conditioned area. How did we ever get by without it?

No Comments