This paper presents an illustrative cross-device evaluation of spatial audio reproduction in smart glasses and XR headsets using binaural in-ear recordings and external sound-level measurements on four anonymized commercial devices. The evaluation is organized around baseline playback behavior, cue fidelity, sound leakage, and robustness to wearing variability, with metrics derived from broadband-noise and swept-sine measurements. The results reveal distinct device behaviors, including differences in channel balance, interchannel signal behavior, preservation of HRTF-encoded binaural cues, perturbation of real-world acoustic cues, external sound radiation, and sensitivity to reseating. Rather than establishing a product ranking, this study demonstrates how the benchmark supports structured cross-device interpretation of wearable XR spatial audio systems.