ikhovryak
diff --git a/‎README.md
+16 b/‎README.md
+16
diff --git a/‎WriteUpForLocalHackDay.docx
-15.1 KB b/‎WriteUpForLocalHackDay.docx
-15.1 KB
diff --git a/‎reconnect_app/get_breaks1.py
+6-7 b/‎reconnect_app/get_breaks1.py
+6-7
diff --git a/‎~$iteUpForLocalHackDay.docx
-162 Bytes b/‎~$iteUpForLocalHackDay.docx
-162 Bytes
@@ -17,3 +17,19 @@ Reconnect first allows the user to listen to a sound file that contains a senten
 Reconnect uses Microsoft Azure’s Speech-to-Text function to convert the user’s speech input into text. By comparing the text against the sentence provided to the user, Reconnect is able to determine if the user’s pronunciation is adequately correct. Following which, Azure’s Text-to-Speech function is used to generate a separate speech output from the same sentence. These two .wav files will then be processed by Reconnect.
 Reconnect uses the SciPy library to convert the sound files into audio data chunks. By using our self-developed algorithms to process the audio data’s amplitude, frequency, and breaks, Reconnect is able to determine the relative speed of vowel enunciation, and the presence of unnaturally long or short breaks between words and sentences.
 Finally, Reconnect will compile all of these feedback before presenting them to the user. The user will then be given the opportunity to try again. The user can also type a sentence which he or she hopes to practice, and Reconnect will generate a sound file to facilitate the same learning process as mentioned above.
+
+# Challenges we ran into
+Since the team comprised of a sophomore and two freshmen having a less technical background, we ran into a lot of difficulties. This was the first time we ever played with APIs and it was difficult to get things working together. In the beginning, we did not go think about the number of channels of the input. Also, for the text comparison, it was necessary to mind the length of the expected text and the received text. While the typed text had to be preprocessed so that it did not contain any characters, the expected text had to be preprocessed so that it omitted some unintended words like “oh”, “umm” etc. Since none of us had enough experience in web development, the significant challenge was getting the input from the user in the form of audio
+- Microphone input
+- Two-channeled audio files and wave comparison algorithms for them
+
+# Accomplishments that we're proud of
+Despite all the challenges, we are proud that we successfully built an interactive platform Reconnect where people can practice speaking to reconnect with the world. Helping thousands of people worldwide in transitioning from impaired hearing to speaking effectively is indeed a great satisfaction to our team.
+
+# What we learned
+Being new to the hackathons, initially, we were unsure if we should go forward with this idea due to technical complexities. It was a second hackathon for all of us and our first-ever time using any sort of APIs. However, we decided to take up the challenge and finally it worked. Therefore, in addition to learning more about programming, using APIs and developing web-site, we learned to think big and apply the knowledge to have an impact on people’s life
+
+# What's next for Reconnect
+- consulting with medical professionals to get effective strategies for speech reconstruction
+- expanding to different languages
+- turning Reconnect into an actual learning platform with the ability to track progress and try different strategies
@@ -110,13 +110,13 @@ def check_sensibility_of_breaks(self, speaker_breaks, correct_breaks):
                 if (speaker_end - speaker_start) > 1.5:
                     self.result["long_breaks"].append(speaker_breaks[i])
                 elif (speaker_break_time - correct_break_time) > 0.30:
-                    self.result["long_breaks"].append((speaker_breaks[i][0] + self.input_sound_start_snip, speaker_breaks[i][1] + self.input_sound_start_snip))
+                    self.result["long_breaks"].append((speaker_start + self.input_sound_start_snip, speaker_end + self.input_sound_start_snip))
                 elif (correct_break_time - speaker_break_time) > 0.30:
-                    self.result["short_breaks"].append((speaker_breaks[i][0] + self.input_sound_start_snip, speaker_breaks[i][1] + self.input_sound_start_snip))
-                if (speaker_start - correct_start) > (last_time_difference + 0.5):
-                    self.result["long_pronunciation"].append((speaker_breaks[i-1][1] + self.input_sound_start_snip, speaker_breaks[i][0] + self.input_sound_start_snip))
-                elif (correct_start - speaker_start) > (last_time_difference + 0.5):
-                    self.result["short_pronunciation"].append((speaker_breaks[i-1][1] + self.input_sound_start_snip, speaker_breaks[i][0] + self.input_sound_start_snip))
+                    self.result["short_breaks"].append((speaker_start + self.input_sound_start_snip, speaker_end + self.input_sound_start_snip))
+                if (speaker_start - correct_start) > (last_time_difference + 0.5) and i > 0:
+                    self.result["long_pronunciation"].append((speaker_breaks[i-1][1] + self.input_sound_start_snip, speaker_start + self.input_sound_start_snip))
+                elif (correct_start - speaker_start) > (last_time_difference + 0.5) and i > 0:
+                    self.result["short_pronunciation"].append((speaker_breaks[i-1][1] + self.input_sound_start_snip, speaker_start + self.input_sound_start_snip))
                 last_time_difference = abs(correct_end - speaker_end)
 
     def remove_audio_wave_silence(self, audio_data, rate, min=None):
@@ -165,7 +165,6 @@ def convert_audio_data_to_chunk_audio_data(self, audio_data, rate):
             chunk_audio_data[i] = chunk_audio_data[i] / (int(rate) / 10)
         return chunk_audio_data, int(rate/10)
 
-
 if __name__ == "__main__":
     # web_file="C:\Users\Samuel\PycharmProjects\speech_analysis\wave_comparison"
     #