xo4da diverse, large-scale multi-modal multi-view video dataset and benchmark challenge centering around simultaneously-captured ego-centric and exo-centric video of skilled human activitiesa diverse, large-scale multi-modal multi-view video dataset and benchmark challenge centering around simultaneously-captured ego-centric and exo-centric video of skilled human activities