On Personalizing “Hey Siri”

From Apple’s Machine Learning Journal, in a piece about what goes on behind the scenes on your devices when you say “Hey Siri”:

We designed the always-on “Hey Siri” detector to respond whenever anyone in the vicinity says the trigger phrase. To reduce the annoyance of false triggers, we invite the user to go through a short enrollment session. During enrollment, the user says five phrases that each begin with “Hey Siri.” We save these examples on the device.

We compare any possible new “Hey Siri” utterance with the stored examples as follows. The (second-pass) detector produces timing information that is used to convert the acoustic pattern into a fixed-length vector, by taking the average over the frames aligned to each state. A separate, specially trained DNN transforms this vector into a “speaker space” where, by design, patterns from the same speaker tend to be close, whereas patterns from different speakers tend to be further apart. We compare the distances to the reference patterns created during enrollment with another threshold to decide whether the sound that triggered the detector is likely to be “Hey Siri” spoken by the enrolled user.

This process not only reduces the probability that “Hey Siri” spoken by another person will trigger the iPhone, but also reduces the rate at which other, similar-sounding phrases trigger Siri.

I found this whole thing very interesting, even as I am not experienced in the ways of machine learning. I found it particularly interesting because of something that happened last week: My wife and I were sitting on the couch and I used “Hey Siri” for something. Out of curiosity, I checked to see if it triggered hers, and indeed it did not. With my iPhone, iPad, and Apple Watch at the ready, I had her try to trigger my devices multiple times, with no success.

It’s neat to see what goes into helping Siri reduce the chances of these false activations. Granted, my wife is a female with a slight Mexican accent (only very slightly). The chance of false activation would be higher with another male speaker, I imagine, but the fact that it is able to store and use the enrollment examples to cut down on this is still really cool.

Learn Ulysses: Here It Is (The Sweet Setup)

You deserve to be free to focus on your ideas, your writing, your notes, and your research. That’s why I use Ulysses, and that’s why I want to help you learn Ulysses and discover everything it’s capable of doing.

The Sweet Setup has launched a new course for learning Ulysses, the powerful dual-platform (iOS and macOS) text editor from developers The Soulmen. I haven’t had a chance to check out the course yet but it sounds good and has some good reviews from those who have. Learn Ulysses consists of 7 videos, which seem to cover everything there is to know about using Ulysses.

Ulysses has been the subject of some discussion lately, since their recent switch to a subscription pricing model. Some folks might not be able to justify the cost given how they use Ulysses, while others will know that they use it too much not to purchase a subscription. Personally, I took advantage of the annual subscription’s discount for existing users. I don’t hold any issue with their decision to switch, as long as it enables them to continue to provide the excellent quality Ulysses customers have come to know and expect in their favorite writing app.

If you recently decided to purchase a subscription for Ulysses, or you’re considering whether it will be worth it for you, personally, Learn Ulysses might be able to help you make that decision or figure out if it was the right decision for you to have made, by showing you everything you can do with it and maybe even giving you some ideas on ways you can take your use further. Like I said, I haven’t seen it, and this isn’t a review. However I like and trust the work of Shawn Blanc and the others at The Sweet Setup.

If you’re interested in this course, it’s usually $29 but you can get a launch week special of 20% and grab it for $23 now.