Co-op Researcher– Multimodal&Voice Interaction (HCI), Markham
Co-op Researcher– Multimodal&Voice Interaction (HCI), Markham
-
Markham, Canada
-
Dernière édition le: hier
-
Ajouter
Description
Huawei Canada has an immediate co-op opening for a Researcher.
About the team The Human-Machine Interaction Lab unites global talents to redefine the relationship between humans and technology. Focused on innovation and user-centered design, the lab strives to advance human-computer interaction research. Our team includes researchers, engineers, and designers collaborating across disciplines to develop novel interactive systems, sensing technologies, wearable and IoT systems, human factors, computer vision, and multimodal interfaces. Through high-impact products and cutting-edge research, we aim to enhance user experiences and interactions with technology.
About the job
Prototype Implementation: Build simple prototypes (applications or small services) that demonstrate voice‑first multimodal interactions (e.g., voice + touch, voice + gesture) on real devices. You’ll work closely with the team to turn ideas into working demos.
Model Integration&Tuning: Use existing speech tools (e.g., Whisper, Vosk, or similar) to integrate features like speech recognition or text‑to‑speech. Perform light tuning and testing to improve responsiveness and usability.
Multimodal Fusion Logic: Work with inputs such as audio, camera, and touch to implement straightforward interaction logic (e.g., combining voice commands with simple gestures or confirmations).
Collaborative Feature Exploration: Collaborate with researchers and designers to prototype new interaction ideas (e.g., conversational interfaces, voice commands). Help translate concepts into testable demos.
Toolkit&Framework Usage: Use common tools and frameworks (e.g., PyTorch, TensorFlow). Learn how different tools are used for on‑device vs. cloud‑based solutions.
Code Quality&Experimentation: Write clean and readable code. Run small tests (e.g., latency or usability checks) and document what you learn.
total target annual compensation
for this position ranges from $58,000 to $104,000 depending on education, experience, and demonstrated expertise.
Job requirements About the ideal candidate
Currently pursuing a Bachelor’s degree in Computer Science, HCI, or a related field, and enrolled in a co‑op program.
Solid understanding of data structures and algorithms, including the ability to reason about time/space complexity and choose the right structure for the task.
Strong coding skills in at least two of: Python, C++, Java, C#, Kotlin– experience writing programs of several hundred lines independently.
Exposure to voice/speech or AI concepts (e.g., speech recognition, chatbots, or basic machine learning).
Familiarity with at least one ML framework (e.g., PyTorch, TensorFlow) and basic model inference workflow (e.g., loading a model, running inference, post‑processing).
Experience with at least one platform (e.g., Windows, Android, or Linux)– comfortable debugging, using command line, and managing dependencies.
Strong problem‑solving and troubleshooting skills – able to unblock themselves on straightforward issues before asking for help.
Genuine interest in building intelligent voice‑driven interfaces and exploring how speech integrates with other modalities.
Additional Information Huawei Canada is committed to a fair, inclusive, and accessible recruitment process. If you require accommodation during any stage of the hiring process, please let us know and we will work with you to meet your needs.
#J-18808-Ljbffr
About the team The Human-Machine Interaction Lab unites global talents to redefine the relationship between humans and technology. Focused on innovation and user-centered design, the lab strives to advance human-computer interaction research. Our team includes researchers, engineers, and designers collaborating across disciplines to develop novel interactive systems, sensing technologies, wearable and IoT systems, human factors, computer vision, and multimodal interfaces. Through high-impact products and cutting-edge research, we aim to enhance user experiences and interactions with technology.
About the job
Prototype Implementation: Build simple prototypes (applications or small services) that demonstrate voice‑first multimodal interactions (e.g., voice + touch, voice + gesture) on real devices. You’ll work closely with the team to turn ideas into working demos.
Model Integration&Tuning: Use existing speech tools (e.g., Whisper, Vosk, or similar) to integrate features like speech recognition or text‑to‑speech. Perform light tuning and testing to improve responsiveness and usability.
Multimodal Fusion Logic: Work with inputs such as audio, camera, and touch to implement straightforward interaction logic (e.g., combining voice commands with simple gestures or confirmations).
Collaborative Feature Exploration: Collaborate with researchers and designers to prototype new interaction ideas (e.g., conversational interfaces, voice commands). Help translate concepts into testable demos.
Toolkit&Framework Usage: Use common tools and frameworks (e.g., PyTorch, TensorFlow). Learn how different tools are used for on‑device vs. cloud‑based solutions.
Code Quality&Experimentation: Write clean and readable code. Run small tests (e.g., latency or usability checks) and document what you learn.
total target annual compensation
for this position ranges from $58,000 to $104,000 depending on education, experience, and demonstrated expertise.
Job requirements About the ideal candidate
Currently pursuing a Bachelor’s degree in Computer Science, HCI, or a related field, and enrolled in a co‑op program.
Solid understanding of data structures and algorithms, including the ability to reason about time/space complexity and choose the right structure for the task.
Strong coding skills in at least two of: Python, C++, Java, C#, Kotlin– experience writing programs of several hundred lines independently.
Exposure to voice/speech or AI concepts (e.g., speech recognition, chatbots, or basic machine learning).
Familiarity with at least one ML framework (e.g., PyTorch, TensorFlow) and basic model inference workflow (e.g., loading a model, running inference, post‑processing).
Experience with at least one platform (e.g., Windows, Android, or Linux)– comfortable debugging, using command line, and managing dependencies.
Strong problem‑solving and troubleshooting skills – able to unblock themselves on straightforward issues before asking for help.
Genuine interest in building intelligent voice‑driven interfaces and exploring how speech integrates with other modalities.
Additional Information Huawei Canada is committed to a fair, inclusive, and accessible recruitment process. If you require accommodation during any stage of the hiring process, please let us know and we will work with you to meet your needs.
#J-18808-Ljbffr
Informations clefs
-
Nom de l’entrepriseHuawei Canada
-
Titre de posteCo-op Researcher– Multimodal&Voice Interaction (HCI)
Conseils de Sécurité
Méfiez-vous des annonces contenant trop de fautes d’orthographe et de grammaire.
Informations supplémentaires sur l’annonce
Co-op Researcher– Multimodal&Voice Interaction (HCI) est visible sur Locanto dans la rubrique Markham Autres métiers.
Dans cette rubrique, il n’y a actuellement pas d’autres annonces pour Markham.
Vous voulez en voir plus? Alors élargissez votre recherche pour consulter les annonces dans les alentours de Markham, comme par exemple Autres métiers à Whitchurch-Stouffville, Pickering ou encore Richmond Hill. Il y a encore plus de petites annonces dans un rayon de 15 km pour cette rubrique. Cliquez ici pour consulter ces annonces.