Your voice will unlock the magic of XR

Should you ever be in any doubt over the truth of Arthur C. Clarke’s Third Law (the one that states that any sufficiently advanced technology is indistinguishable from magic), just try a HoloLens 2 experience where you can tell holograms what to do.

Anyone who manages to walk away from that without feeling a little bit like a wizard or witch casting a successful spell is likely leading a fairly jaded and joyless existence. There is something fundamentally satisfying, almost primal, about watching the world change as you command it to. Like lucid dreaming.

Yet even the naysayers can probably agree that voice-based technologies driven by artificial intelligence represent a huge market opportunity. Millions of us now own some sort of room-based smart home device such as Google Home or Amazon Echo, and the global voice and speech recognition market size is estimated to reach nearly $32 billion USD by 2025. We are witnessing a convergence of various emerging technologies in that space – such as voice recognition, natural language processing, and machine learning – powered by 5G connectivity and the AR cloud.

In this screenless world of spatial computing, interfaces will need to become more intuitive, efficient, and empathic.

Immersive experiences like those afforded by the HoloLens offer a tantalizing glimpse of where this is all headed: a screenless world where digital interfaces become a part of natural human interactions, creating an entirely new form of hybrid – or extended – reality. In fact, Gartner predicts that this year, 30 percent of web browsing will be done without a screen.

The next technology revolution will usher in the era of spatial computing, where multisensory experiences allow us to interact with both the real and digital worlds through natural, intuitive interfaces such as haptics, limb and eye tracking, and even elements such as taste and scent.

In this screenless world of spatial computing, interfaces will need to become more intuitive, efficient, and empathic. Let’s take a look at three ways in which voice technologies are already enabling this.

Intuitive UX

A woman examining graphical, numerical and written data on a virtual screen that appears like a 2D hologram

Spatial audio and AI-driven voice technologies are crucial elements for creating compelling immersive experiences. As Kai Havukainen, Head of Product at Nokia Technologies explained in an interview for Scientific American, “Building a dynamic soundscape is essential for virtual experiences to really engender a sense of presence.” Humans, he added, are simply hardwired to pay attention to sound and instinctively use it to map their surroundings, find points of interest and assess potential danger.

There are, however, design considerations that must be taken into account when tackling the challenges of an entirely new medium together with fast-evolving technologies.

Tim Stutts, Interaction Design Lead at Magic Leap, highlights the sheer complexity of these UX challenges, “A level of complexity is added with voice commands, as the notion of directionality becomes abstract—the cursor for voice is effectively the underlying AI used to determine the intent of a statement, then relate it back to objects, apps and system functions.”

“For voice experiences, you need to have a natural language interface that performs well enough to understand different accents, dialects, and languages,” adds Mark Asher, director of corporate strategy at Adobe, who believes the advancement of voice technologies will serve to “bring the humanity back to computing,”

There are still many hurdles to overcome before we reach that utopian vision of Star Trek’s universal translator, however. As we move towards more pervasive and complex experiences where users have multiple applications open at the same time, they will need to circumvent problems such as unintentionally commanding a hologram when you’re actually talking to the person next to you.

Yet looking at the exponential way AI technologies have developed over the past decade, it isn’t unreasonable to extrapolate that the next few years we will usher in real-time contextual applications that accurately identify and action commands based on accurate assessments of your surroundings (both real and virtual), your personal preferences, and even your biofeedback.

Voice biofeedback

Extended reality (XR) technologies already deploy a multitude of sensors that enable the collection of biofeedback, yet voice provides a rich vein of data that can be collected without the need for cumbersome wearables.

Apart from deliberately using commands to interact with the world around us, our voices provide the scope for AI to contextualize our XR experiences based on subconscious factors such as our mood and physical health. Cymatics – the name given to the process of visualizing soundwaves – gives us some insight into the depth and complexity of the unique patterns projected by our voice.

To produce speech, the brain communicates with the Vagus Nerve and sends a signal to the larynx, which vibrates out stored information through the vocal cords. Since vocalization is entirely integrated within both our central (CNS) and autonomic nervous system (ANS), there is an established correlation between voice output and the impact of stress.

Researchers have been developing methods for voice stress analysis (VSA) and computerized stress detection and body scanning devices for many years. Companies such as Insight Health Apps already leverage this rich data to feed corrective waveforms and patterns back into the body in the form of “quantum biofeedback”.

Bridging the Uncanny Valley

When I was first invited to test the social VR platform Sansar, I was shown around some of its virtual worlds by Linden Lab’s CEO Ebbe Altberg. To this day, my lasting impression of that demo was how our interaction felt very natural in spite of us being 5,000 miles and several time zones apart (I was in London and he in San Francisco) and the fact that his avatar looked nothing like his real-world persona.

Speech Graphics</a> developed the technology that creates this notoriously difficult-to-achieve illusion that an animated face is the source of the sound you hear. Their pipeline merges powerful speech analysis with procedural animation techniques. To achieve this, the algorithm replicates not only the movement of the lips but also decodes from that speech the energy and emotion of the speaker.</p> <figure class="wp-block-pullquote" readability="1.5"> <blockquote class="has-text-color" readability="31"> <p>Technology will soon be sufficiently advanced so that it will become an invisible layer of our reality </p> </blockquote> </figure> <p>“In the sound of speech, there is a wealth of information about what the speaker was doing when he or she made the sound—including the movements of the mouth as it produced the sound, and the energetic state of the speaker, from which we can deduce facial expression. From syllables to scowls,” its website reads. And because it is a universal physical model, it works for any language and any type of character, from realistic humans to cartoon-like avatars. </p> <h3>The future of voice</h3> <p>As digital experiences move beyond the familiar constraints of screens, our modes of interaction with the digital world are also evolving. Paradoxically, that evolution is taking us back to basic and instinctual forms of natural human interaction, hence the enduring relevance of Clarke’s Law. Technology will soon be sufficiently advanced so that it will become an invisible layer of our reality rather than a separate realm requiring special skills to access. And in that hybrid reality, we will experience an entirely new type of magic. </p> <p> <a href="https://www.futurithmic.com/2020/02/28/your-voice-will-unlock-magic-of-xr/">Source</a></p> </div> <div class="et_post_meta_wrapper"> <section id="comment-wrap"> <div id="comment-section" class="nocomments"> </div> <div id="respond" class="comment-respond"> <h3 id="reply-title" class="comment-reply-title"><span>Submit a Comment</span> <small><a rel="nofollow" id="cancel-comment-reply-link" href="/your-voice-will-unlock-the-magic-of-xr/#respond" style="display:none;">Cancel reply</a></small></h3><form action="https://netsmiami.com/wp-comments-post.php" method="post" id="commentform" class="comment-form"><p class="comment-notes"><span id="email-notes">Your email address will not be published.</span> <span class="required-field-message">Required fields are marked <span class="required">*</span></span></p><p class="comment-form-comment"><label for="comment">Comment <span class="required">*</span></label> <textarea id="comment" name="comment" cols="45" rows="8" maxlength="65525" required="required"></textarea></p><p class="comment-form-author"><label for="author">Name <span class="required">*</span></label> <input id="author" name="author" type="text" value="" size="30" maxlength="245" autocomplete="name" required="required" /></p> <p class="comment-form-email"><label for="email">Email <span class="required">*</span></label> <input id="email" name="email" type="text" value="" size="30" maxlength="100" aria-describedby="email-notes" autocomplete="email" required="required" /></p> <p class="comment-form-url"><label for="url">Website</label> <input id="url" name="url" type="text" value="" size="30" maxlength="200" autocomplete="url" /></p> <p class="comment-form-cookies-consent"><input id="wp-comment-cookies-consent" name="wp-comment-cookies-consent" type="checkbox" value="yes" /> <label for="wp-comment-cookies-consent">Save my name, email, and website in this browser for the next time I comment.</label></p> <p class="form-submit"><input name="submit" type="submit" id="submit" class="submit et_pb_button" value="Submit Comment" /> <input type='hidden' name='comment_post_ID' value='1302' id='comment_post_ID' /> <input type='hidden' name='comment_parent' id='comment_parent' value='0' /> </p><p style="display: none;"><input type="hidden" id="akismet_comment_nonce" name="akismet_comment_nonce" value="945074821e" /></p><p style="display: none !important;" class="akismet-fields-container" data-prefix="ak_"><label>Δ<textarea name="ak_hp_textarea" cols="45" rows="8" maxlength="100"></textarea></label><input type="hidden" id="ak_js_1" name="ak_js" value="229"/><script>document.getElementById( "ak_js_1" ).setAttribute( "value", ( new Date() ).getTime() );</script></p></form> </div> </section> </div> </article> </div> <div id="sidebar"> <div id="search-2" class="et_pb_widget widget_search"><form role="search" method="get" id="searchform" class="searchform" action="https://netsmiami.com/"> <div> <label class="screen-reader-text" for="s">Search for:</label> <input type="text" value="" name="s" id="s" /> <input type="submit" id="searchsubmit" value="Search" /> </div> </form></div> <div id="recent-posts-2" class="et_pb_widget widget_recent_entries"> <h4 class="widgettitle">Recent Posts</h4> <ul> <li> <a href="https://netsmiami.com/podcast-episode-42-b2b2x-is-everything-to-5g-with-amir-rao-of-aws/">Podcast episode 42: B2B2X is everything to 5G (with Amir Rao of AWS)</a> </li> <li> <a href="https://netsmiami.com/what-is-web-3-0-and-how-will-it-impact-your-organization/">What is Web 3.0 and how will it impact your organization?</a> </li> <li> <a href="https://netsmiami.com/podcast-episode-41-bringing-edge-computing-and-ai-together-in-a-5g-world/">Podcast episode 41: Bringing edge computing and AI together in a 5G world</a> </li> </ul> </div><div id="recent-comments-2" class="et_pb_widget widget_recent_comments"><h4 class="widgettitle">Recent Comments</h4><ul id="recentcomments"></ul></div><div id="archives-2" class="et_pb_widget widget_archive"><h4 class="widgettitle">Archives</h4> <ul> <li><a href='https://netsmiami.com/2021/04/'>April 2021</a></li> <li><a href='https://netsmiami.com/2021/03/'>March 2021</a></li> <li><a href='https://netsmiami.com/2021/02/'>February 2021</a></li> <li><a href='https://netsmiami.com/2021/01/'>January 2021</a></li> <li><a href='https://netsmiami.com/2020/12/'>December 2020</a></li> <li><a href='https://netsmiami.com/2020/11/'>November 2020</a></li> <li><a href='https://netsmiami.com/2020/10/'>October 2020</a></li> <li><a href='https://netsmiami.com/2020/09/'>September 2020</a></li> <li><a href='https://netsmiami.com/2020/08/'>August 2020</a></li> <li><a href='https://netsmiami.com/2020/07/'>July 2020</a></li> <li><a href='https://netsmiami.com/2020/06/'>June 2020</a></li> <li><a href='https://netsmiami.com/2020/05/'>May 2020</a></li> <li><a href='https://netsmiami.com/2020/04/'>April 2020</a></li> <li><a href='https://netsmiami.com/2020/03/'>March 2020</a></li> <li><a href='https://netsmiami.com/2020/02/'>February 2020</a></li> <li><a href='https://netsmiami.com/2020/01/'>January 2020</a></li> <li><a href='https://netsmiami.com/2019/12/'>December 2019</a></li> <li><a href='https://netsmiami.com/2019/11/'>November 2019</a></li> <li><a href='https://netsmiami.com/2019/10/'>October 2019</a></li> <li><a href='https://netsmiami.com/2019/09/'>September 2019</a></li> <li><a href='https://netsmiami.com/2019/08/'>August 2019</a></li> <li><a href='https://netsmiami.com/2019/07/'>July 2019</a></li> <li><a href='https://netsmiami.com/2019/06/'>June 2019</a></li> <li><a href='https://netsmiami.com/2019/05/'>May 2019</a></li> <li><a href='https://netsmiami.com/2019/04/'>April 2019</a></li> <li><a href='https://netsmiami.com/2019/03/'>March 2019</a></li> <li><a href='https://netsmiami.com/2019/02/'>February 2019</a></li> <li><a href='https://netsmiami.com/2017/04/'>April 2017</a></li> </ul> </div> </div> </div> </div> </div> <footer id="main-footer"> <div id="footer-bottom"> <div class="container clearfix"> <ul class="et-social-icons"> <li class="et-social-icon et-social-facebook"> <a href="https://www.facebook.com/netsmiami/" class="icon"> <span>Facebook</span> </a> </li> <li class="et-social-icon et-social-twitter"> <a href="https://twitter.com/Nets_Miami" class="icon"> <span>Twitter</span> </a> </li> <li class="et-social-icon et-social-instagram"> <a href="https://www.instagram.com/netsmiami/" class="icon"> <span>Instagram</span> </a> </li> <li class="et-social-icon et-social-rss"> <a href="https://netsmiami.com/feed/" class="icon"> <span>RSS</span> </a> </li> </ul><div id="footer-info"><a href="http://netsmiami.com/">Developed by ©NetsMiami 2019 </a> | <a href="http://netsmiami.com/privacy-policy-2/">Privacy Policy </a> </div> </div> </div> </footer> </div> </div>  <script type="text/javascript"> var subscribersSiteId = '8b8be954-276d-43f1-bfa4-5c984d9f0ccb'; var subscribersServiceWorkerPath = '/?firebase-messaging-sw'; </script> <script type="text/javascript" src="https://cdn.subscribers.com/assets/subscribers.js"></script>   <script type="text/javascript"> var sbiajaxurl = "https://netsmiami.com/wp-admin/admin-ajax.php"; </script> <div class='mwai-chatbot-container' data-params='{"aiName":"AI: ","userName":"User: ","guestName":"Guest:","textSend":"Send","textClear":"Clear","textInputPlaceholder":"Type your message...","textInputMaxLength":512,"textCompliance":"","startSentence":"Hi! How can I help you?","localMemory":1,"themeId":"chatgpt","window":true,"icon":"","iconText":"","iconTextDelay":1,"iconAlt":"AI Engine Chatbot","iconPosition":"bottom-right","iconBubble":"","fullscreen":"","copyButton":""}' data-system='{"botId":"default","customId":"4de20811ba14fbe1112c9772ab937b81","userData":null,"sessionId":"N\/A","restNonce":null,"contextId":1302,"pluginUrl":"https:\/\/netsmiami.com\/wp-content\/plugins\/ai-engine\/","restUrl":"https:\/\/netsmiami.com\/wp-json","stream":true,"debugMode":false,"speech_recognition":false,"speech_synthesis":false,"typewriter":false,"virtual_keyboard_fix":false,"actions":[],"blocks":[],"shortcuts":[]}' data-theme='{"type":"internal","name":"ChatGPT","themeId":"chatgpt","settings":[],"style":""}'></div> <noscript><iframe src="https://www.googletagmanager.com/ns.html?id=GTM-NDLLWGF" height="0" width="0" style="display:none;visibility:hidden">