I didn't really get into this story. I was waiting for it to start, but it seemed to never get past navelgazing and complaining about whether his choice is the right one or not. I'm with those who thought he was being a creeper, very much "put the woman on the pedestal" but the ultimate case of it because he can keep her as a bridesicle to keep her on that pedestal. All his talk about her not being able to handle the future was self-serving BS that didn't even fit in the image that he painted of her. But all that was well again, but nothing really happened in the body of the story apart from the navelgazing.
Regarding the sound effects, I thought they detracted from the story. "Basically, ice clinking is cool, but mouth sounds are gross?" I think there's some truth to that. In real life I don't hear mouth noises, but I also don't routinely listen with my ear two inches away from people's mouths. Microphones pick up all kinds of crap that we filter out in our everyday lives but something about the recording or listening process makes some of them suddenly much harder to ignore.
Also, at least for me while listening with one headphone while driving, maybe the sound effects were constant throughout but I just heard the occasional one, maybe the louder ones that could rise above the road noise. Those were quiet, too quiet, to the point that every time I thought "wait, was that slurping or just smacking his laps or what exactly was that?" and then I missed a couple sentences of the words after that. And with not being particularly engaged in the story it only made me wait and try to predict where the next sound would be because that was more interesting than the story itself. I think that if the glass noises and stuff were important to establish the ambience (which I don't think they were) then I don't think that the reader actually manipulating a glass was the way to go because the volume balance made it all the more distracting. If it were important I think that a pre-recorded kind of sound effect that could be volume balanced separately would be better so that if they need to be there they're at a volume that isn't just on the lower edge of my volume listening range.