It probably would. The other graphical elements have drop shadows as well.
It shouldn't have them, though. This way the user can use whatever album art size he or she likes.
Moving the text elements up or down to make them align better with album art doesn't requiere viewport conditionals in most cases. the nano, x5 and h100 could use this feature, yes, but they don't look bad without viewport conditionals, either.
there is not much room for optimizing, anyway. most improvements require nudging elements one or two pixels up or down.