APPLE

Apple's "Hey Siri" team just spilled what's coming next
Apple has pulled the curtain on Siri, and specifically the "Hey Siri" voice trigger feature that, from iPhone 6 on, allowed the assistant to recognize only its owner. In a new article published on Apple's Machine Learning Journal, the Siri team details a paper it submitted to the International Conference on Acoustics, Speech, and Signal Processing (ICASSP) which kicks off today.Turns out, differentiating between users on a low-power core that can be always-listening even with the iPhone in standby is trickier than you might expect. The "Hey Siri" phrase itself came with both positives and negatives attached, too. On the one hand, the team points out, users were already familiar with the phrasing: indeed, some were already saying it even when manually triggering Siri using the home button.However, it's also a fairly short phrase. That gives Siri very little data from which it needs to pull out all the information required to recognize both definite intent and that's the correct user asking.While Siri has an explicit training period when the assistant is first set up, those five sample phrases the user is asked to repeat do have some drawbacks. For example, there is very little "environmental variability" in the Siri team points out. In more typical use the real-world situations are seldom ideal for a clean recognition.
As with earlier machine learning insights from Apple’s journal, the detail itself might be a little too rich if you’re not into AI and audio recognition. The project was set up in mid-2017, as part of Apple’s new-found openness about some of the research projects going on inside the traditionally clandestine company. If that also encourages new engineers to apply to join the firm’s machine learning divisions, all the better.
As for what comes next, the Siri team has a few suggestions. On the one hand, there are obvious places where Siri – like all voice recognition systems – still struggles with speaker clarity. That includes both larger spaces, where reverberations become an issue, and noisy environments such as strong winds or while in the car.
However, another goal is to do away with the training requirement altogether. “Looking even further ahead,” the team writes, “we imagine a future without any explicit enrollment step in which users simply begin using the “Hey Siri” feature from an empty profile, which then grows and updates organically as more “Hey Siri” requests come in.”
Chris Davies
No comments:
Post a Comment