1 - I don't know if this effect will work at all, because people read at different speeds.
2 - Can you actually concentrate with the words and music at once?
3 - Is this an absolutely pointless combination?
4 - Can I even DESCRIBE places in detail - I mean, I have to focus on feelings and expression. Sentences like "Bodies in motion moving greeting drinking swinging breathing". Most of the time I'm completely ignoring grammar rules - is this acceptable?
Okay, first off I think you have a interesting idea. Although I can see how you are running into some problems. As for your first problem I really can't help you there because, yeah of course people read at different speeds! But you are basically going to have to just bet on them keeping up to the pace of the song.
Secondly, if people can drive and put on make-up, text, curl their hair - not that any of those things are good ideas - than I am pretty sure that they could concentrate on music and reading.
Thirdly, I don't think it's pointless! You have a cool idea so how could it be pointless? People listen to music and read at once all the time, so by putting the music and the book in harmony you get a clear picture of emotion and action behind the story the words and the music tell.
As for your last question I really don't know about describing the places. I guess that is up for you to decide as the writer. Also, I don't know what you have written so I really can't say. When it comes to grammar rules I am pretty sure it's acceptable, but I suppose I could be wrong.
Anyway, I hope this helps! Good luck!