Skip to content

deanhu0822/Curie

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

21 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

:.Curie.:

The basic process of the program is as follows: -> User inputs address text -> user selects address from autocomplete list -> the address is converted to lat/long which obtains Google static street view images -> images are analyzed by Google Vision AI Labels API which analyzes features of images and corresponding scores for features -> the labels and scores are parsed and compiled into a single string -> string is passed to GPT which returns a response which describes the scenery of the area based on labels and scores

This program is meant to be paired with an accessibility feature like VoiceOver for mac. VoiceOver reads text on the screen aloud depending on the desired focus of the user. The main purpose of a reader such a VoiceOver in our program would be to read the description prompt generated by GPT. This brings a partial experience of Google Street View to the visually impaired.

Instructions for program:

  1. Type address into address search bar or use the mic feature to use speech to text.
  2. Select address from autocomplete suggestions.
  3. Read the output from GPT printed in the text box.

APIs used

  1. Google Street View Static API
  2. Google Places API
  3. Google Cloud Vision API
  4. OpenAI Completions API