But no matter how much you invest in speakers ( not only the speakers, the room acoustics need to be treated too and if you're unlucky, that too can cost a lot) some details are only possible to detect in headphones/ IEMs.
I've a very expensive pair of Barefoot monitors mostly for work (and would never suggest such expensive set up for casual listeners) and miss a lot of details in normal volume even in a semi-treated room.
They are exceptional for panning etc...don't get me wrong, they are exceptional in detail retrieval, but it's my ageing ears that can't pick up subtle things in a mix (lot easier in headphones) from normal distance.