#AI #personal Project

SingSmart

OverviewBusiness ProblemDiscoveryDefineIdeationEvaluationFinal Design

Overview

This project was a speculative UX case study to explore AI/ML opportunities in consumer mobile applications, focusing on real-world applicability and user-centered AI system design.

This project was developed as part of the Human Factors in AI course on Coursera, focusing on applying user-centered design principles to AI-driven applications.

Detail

Role: UX Designer

Type of Project: Personal Project

Duration: 2 Weeks

Purpose: Online Certification Assignment

Disclaimer: This feature has already been launched by Certinia. All data and name included in the prototype are entirely fictional and intended solely for demonstration purposes. For more information and latest updates, please read Certinia official release notes.

Discovery

Business Problem

Problem statement:Singing beginner encounter difficulties in learning music theory and sing perfect pitch during their practice

Objective: Encourage music beginner to participant more music activities with more confident and allow them to learn music in a easy and efficient way through self learning 

Define

Persona

The target user of this application is for music beginner who have no or little music background, and they want to learn how to sing.

User Challenges

The above diagram demonstrate the user journey of learner learning to sing a new song. The user challenges and opportunities I identified in the user journey included:

1. High Manual Effort

Requires high manual effort in identifying sofa syllables and correct pitch for the new music. With using AI, it can help users to understand the melody of the new music effectively.

2. Inaccuracy

Also, beginner can easily make mistake and those mistakes will influence their confident in learning music, therefore a tools with AI to assess learners’ pitch, rhythm and vocal and give recommendation can speed up learner learning progress

Solution & Hypothesis

Regarding the research, I believe AI singing mentor application can help music beginner learn new music more efficiently by reducing 20% of learning time.

Ideation

User Story Map

Referencing previous task analysis, user journey for the application is designed. The AI occurs while the application generate sofa syllables from the images using image processor, also evaluating learner performance and giving suggestion for improvement. 

To ensure model transparency, the application uses interpretable model. The performance score is determined on pitch, tempo and vocal. The determination of tempo and pitch score are based on non AI technology like turner and BPM, and determination of the vocal score is based on AI. To train the data model,  singing audio from different level of singer are collected, and professional singing teacher are asked to score the performance. And the system will find the pattern and trend of the score by vocal characteristics. 

Besides of the end to end user journey, feedback loop and fallback UX flow are designed to accompany different situation and AI technologies constraints:

1. Uncertainty

Sometime the app may not give the right score and accurate sofa syllables, as there are chance that the chord and tempo are not included in the database. Regarding this scenarios, it will give an confidence score, so the users can determine whether they want to follow all the syllables presented. We also rely on user feedback to determine whether the generated sofa syllables have any error.

2. Environment

This app relies heavily on sound detection technology, making it sensitive to environmental noise, which can affect recognition accuracy. In addition to using a confidence score to assess result reliability, the system monitors for high ambient noise levels and the presence of secondary human voices, prompting a reminder to users to ensure optimal recording conditions.

Evaluation

Usability Test (Hypthetic)

Aim

Understand how easily users can complete singing training tasks and perception of AI generated score

Methodology

Task-based testing using a 'Wizard of Oz' prototype, where users are asked to complete the end to end learning journey to evaluate usability. Also, we will simulate situation where the app cannot provide accurate result to evaluate how users perceive the fallout scenarios. The research mix with both quantitive and qualitative data, where success rate is measured and user feedbacks are summarised.

Participants

Users with vary singing skill and age group will be equally represented in the group of paricipants.

Hypothesis

  1. Success rate of completing the learning journey will be over 90%.
  2. Users understand the reason behind fallout situation and solutions.
  3. 90% of users will find the information can help them improve their singing skills.

Final Design

Prototype

Regarding all consideration, here are the prototype of the application which demonstrate the user experience of learning new song. To maintain transparency, there is a disclaimer (by clicking the i icon) stated that sofa syllables are generated by AI. User can learn more about the data model and they can report any error through provided link.

Same with performance score, there is a disclaimer stated that the score evaluated by AI. User can learn more about the data model and they can report any error through provided link. Also, contributed factors to the performance score are clearly stated, and suggestion of improving the score is provided. 

Presentation

I also did a presentation for the project, please feel free to watch it:

Ethnic Consideration for AI

Privacy Consideration 

Concern: This app may collect users individual‘s acoustic fingerprint which is directly identifiable and may cause concern with cyber security.

Privacy Law: As the application is an online service and analyse user data, it will be regulated by the privacy laws of each country that the application provides service for, including GDPR. Although this application won’t receive any federal funding, the application may be utilised in public educational section, it will follow educational privacy law, such as it won’t disclose any user data to third party and providing consent during onboarding.

Bias

Representation bias: Data collection may introduce representation bias due to the limited diversity of volunteers and singing tutors in terms of gender, ethnicity, musical background, and vocal style. Tutors may favor certain vocal characteristics or genres, and if the training data lacks variety, specific vocal types could be underrepresented in the model.

Method to mitigate bias: 1. Recruit diverse singing tutor and audio sample. 2. Selective of audio will be re evaluated by human singing tutors

Fairness: The intended purpose of the application is only used for teaching users how to sing, therefore it should not cause any fairness issues. There will be a disclaimer in the consent to state the designed purpose of the model and warn users it may caused unfairness if it is used for other purposes, such as admission for music school, auditioning and more.

Transparency: Explaining the attributes that contribute to the score and insight how user can improve their performance

Accountability: Provide a way for users to report any bias and unfair use.

Reflection

As this is a self learning project, there is lack of time and resources to conduct proper research and usability testing in with prototype.

For more information, please check out the link below
Read More

Thank you

Feel free to check out my other works

Feature Design
Value Tracker
December 1, 2024
Read More
Feature Design
Guided Scoping
April 18, 2024
Read More
Feature Design
PSA - Calendar Component
July 1, 2023
Read More
Mixed Reality
So You Think You Can Make Art?
April 1, 2022
Read More