Feature #3607: Implementing AI into Hope project - AAC - EXPO projects

Feature #3607

Implementing AI into Hope project

Added by Kasthuri G R 8 months ago. Updated 6 months ago.

Status:

Assigned

Start date:

03/04/2024

Priority:

Normal

Due date:

04/30/2024

Assignee:

Kasthuri G R

% Done:

Category:

Spent time:

Target version:

Description

Take input words from Hope, generate sentences using large language models (LLMs) with basic commonly used words

UI_1.png (199 KB) Kasthuri G R, 03/20/2024 06:41 PM

UI_2.png (247 KB) Kasthuri G R, 03/20/2024 06:41 PM

ThankYou.png (68 KB) Kasthuri G R, 03/21/2024 06:38 AM

Classroom.png (117 KB) Kasthuri G R, 03/21/2024 06:38 AM

Nice_to_meet.png (90.3 KB) Kasthuri G R, 03/21/2024 06:39 AM

Sorry.png (67.7 KB) Kasthuri G R, 03/21/2024 06:39 AM

2.keywords-dog-play-ball.png (534 KB) Kasthuri G R, 05/30/2024 08:36 PM

1.keywords-dog-play-ball-happy.png (531 KB) Kasthuri G R, 05/30/2024 08:36 PM

3.keywords-I-sleepy-hungry.png (500 KB) Kasthuri G R, 05/30/2024 08:36 PM

4.keywords-I-pain-sad.png (507 KB) Kasthuri G R, 05/30/2024 08:36 PM

5.keywords-sorry-helpme.png (498 KB) Kasthuri G R, 05/30/2024 08:36 PM

6.keywords-we-sorry.png (539 KB) Kasthuri G R, 05/30/2024 08:36 PM

History

#1 Updated by Kasthuri G R 8 months ago

Google has released an open AI model Gemma(https://blog.google/technology/developers/gemma-open-models/). I am able to generate sentences for the given input words.

I have provided the GSheet link below with the input words and corresponding AI results generated by Gemma model

[[https://docs.google.com/spreadsheets/d/1CluoU2D38b6nuRMt4AJDTpU6_GFoB0e3VcQOo0_QJ0Y/edit?usp=sharing]]

This is open source model and it gives expected results. We will have to design the prompts to get better results based on the user needs. I will set a deadline based on the list of features we deciding

#2 Updated by Kasthuri G R 8 months ago

To get inputs from special educators, made a sample UI in python that user can upload image and it will generate sentences for the given image. I have implemented the functionality. Yet to make the UI better with background color and alignment.

#3 Updated by Kasthuri G R 8 months ago

File UI_1.png added
File UI_2.png added

Made a sample UI that allows to give input image and generate sentence. The model output varies like below. I am trying to figure out the method to filter the output.

For eg: Given Prompt "construct 5 basic proper sentences about {input word} and its habits for primary school students"

AI output

Step 1/5
1. Parrots are beautiful birds that come in many different colors and sizes.

Step 2/5
2. Parrots are very smart and can learn many tricks and even talk.

Step 3/5
3. Parrots are social birds and love to be around other parrots and people.

Step 4/5
4. Parrots need a lot of attention and exercise to stay healthy.

#4 Updated by Anil Prabhakar 8 months ago

Tracker changed from Bug to Feature
Project changed from CREATE to AAC
Due date set to 04/30/2024
Status changed from New to Assigned

There should be a Due Date for every task and an updated %Done. Cannot work without any deadlines.

Do we have a creative common license to use these images?
How are you imagining the use case, is this set up by a teacher, and the child clicks on a button, or is the child supposed to upload an image?
This is not picture to speech (think AI into HOPE...as the subject of this Issue). I am expecting a communication tool, where a user selects 1 or more images from the KAVI-PTS library, AI generates a few possible sentences, and the child picks one of them.

#5 Updated by Kasthuri G R 8 months ago

File ThankYou.png added
File Classroom.png added
File Nice_to_meet.png added
File Sorry.png added

Do we have a creative common license to use these images?
I will use the KAVI Library images

How are you imagining the use case, is this set up by a teacher, and the child clicks on a button, or is the child supposed to upload an image?
This is a sample UI we decided to create and show it to special educators in Mira school to get their ideas on the requirement. So we can plan the development in both Hope and Kavi-PTS based on their feedback

The sample use cases I have considered are below

1. AI to generate Sentences with the "Thank You, Sorry, Nice to meet you" which children can use. These are the pictures under Quick talk in the Kavi Library
2. Common sentences used by the children in Classroom. Class room is also the picture from our image Library

The respective AI responses for the above inputs are uploaded.

#6 Updated by Anil Prabhakar 8 months ago

Use the > to quote, otherwise your reply is mixed with my comment.
Sentences are quite ambiguous and will become frustrating. We may need more than one image to create a sentence. Let us try with 2 images.

#7 Updated by Kasthuri G R 6 months ago

When tried to run the python code where it takes input as Image(one) and generate text. It didn't run as huggingface has updated the libraries and the code need to updated to run the model without error. I have fixed the library issue and made the code functional.

I will modify the code to accept two images as inputs to generate text tomorrow

#8 Updated by Kasthuri G R 6 months ago

File 1.keywords-dog-play-ball-happy.png added
File 2.keywords-dog-play-ball.png added
File 3.keywords-I-sleepy-hungry.png added
File 4.keywords-I-pain-sad.png added
File 5.keywords-sorry-helpme.png added
File 6.keywords-we-sorry.png added

I made a UI to display the images. Images can be select/deselect on clicking. Once selecting the images, Clicking on Generate sentences will display the possible sentences with the keywords(image names).

I am sharing the sample outputs of the sentences generated by gemma model.

Prompt used "Generate sentences suitable for children using the all the key words. Make the sentences short, simple, and conversing. ";

Need to try with different prompts and have to run different models to compare the results

#9 Updated by Anil Prabhakar 6 months ago

Think about your conversation with anyone. When you are trying to communicate with your son, will the prompt be "generate suitable sentence"?

Don't be a programmer, put more thought into how you will use GenAI to build an AAC solution.

Also available in: Atom PDF

AAC

Issues