Efficient LLL-based lattice reduction for MIMO detection: From algorithms to implementations