SSL-Wearables (HARNet)

Self-supervised CNN pretrained on 700,000 person-days of UK Biobank accelerometer data for human activity recognition across devices and cohorts.

Released: March 2024

SSL-Wearables, whose pretrained networks are distributed as HARNet, is a family of self-supervised deep learning models for human activity recognition (HAR) from wrist-worn accelerometer data. Developed by the Oxford Wearables Group (OxWearables) at the University of Oxford and published in npj Digital Medicine in 2024, the work addresses a persistent bottleneck in digital health: labeled accelerometer data is scarce and expensive to collect, while raw sensor data from population-scale studies is abundant but unlabeled. By pretraining on enormous quantities of unlabeled signal, the authors produce feature extractors that transfer to small labeled datasets and substantially outperform models trained from scratch.

The central contribution is leveraging the UK Biobank accelerometer study — approximately 700,000 person-days of wrist-worn triaxial accelerometer recordings — as an unlabeled corpus for self-supervised pretraining. This is among the largest self-supervised efforts applied to wearable time series, and it demonstrates that the representation-learning paradigm that transformed vision and language can be applied to consumer-grade movement sensors.

Critically, the resulting models generalize beyond their pretraining cohort, maintaining strong performance across different sensor devices, study populations, and free-living versus laboratory environments — addressing a long-standing concern that HAR models overfit to the specific hardware and protocol used to collect their training data.

Key Features

Massive self-supervised pretraining: The feature extractor is pretrained on roughly 700,000 person-days of unlabeled UK Biobank accelerometer data, removing the dependence on costly manual activity annotation.
Multi-task pretext objectives: Pretraining combines three self-supervised tasks — arrow-of-time (temporal direction) prediction, channel permutation detection, and time-warping detection — which jointly teach the network temporally and structurally meaningful motion representations.
Three time-scale variants: Released as HARNet5, HARNet10, and HARNet30 for 5-, 10-, and 30-second windows (150, 300, and 900 samples at 30 Hz), letting downstream users match the model to their labeling granularity.
Cross-device and cross-cohort generalization: Models transfer across external datasets, different participant populations, living environments, and sensor hardware rather than being tied to UK Biobank's specific device.
Drop-in PyTorch Hub access: Pretrained weights load in one line via torch.hub.load('OxWearables/ssl-wearables', 'harnet5', pretrained=True), providing a ready feature extractor with an untrained classification head for fine-tuning.

Technical Details

The architecture is a ResNet-style one-dimensional convolutional neural network that ingests 3-channel (x, y, z) accelerometer signals sampled at 30 Hz. The model is split into a pretrained convolutional feature extractor and a task-specific classifier head that users fine-tune on their own labeled data. Pretraining uses the multi-task self-supervised objectives described above on the UK Biobank corpus; downstream adaptation requires only the modest labeled set for the target task. Evaluated on eight HAR benchmark datasets spanning different devices and populations, the self-supervised models consistently outperform strong supervised baselines, with reported F1 relative improvements of 2.5–130.9% (median 24.4%). Gains are largest on small benchmarks such as the ADL dataset and smallest on already data-rich benchmarks like Capture-24.

Applications

SSL-Wearables targets researchers in physical activity epidemiology, digital health, and behavioral science who need to classify activities (e.g., sleep, sedentary behavior, walking, exercise) from wrist accelerometers but have limited labeled data. Because the pretrained feature extractor transfers across devices and cohorts, groups running clinical trials, cohort studies, or consumer-wearable analyses can fine-tune a strong model on a few hundred labeled examples rather than collecting and annotating data at scale. The one-line PyTorch Hub interface makes it straightforward to integrate into existing accelerometer-processing pipelines.

Impact

The work established self-supervised learning as a practical paradigm for wearable sensor data, showing that population-scale unlabeled cohorts like UK Biobank can be mined to build broadly transferable activity-recognition models. The openly released HARNet weights have become a widely used starting point for accelerometer-based HAR, lowering the barrier for labeled-data-poor studies and informing downstream digital biomarker research. A key limitation is licensing: the code and weights are released for academic use only, with commercial use requiring a license from Oxford University Innovation, and the models are specialized to wrist-worn triaxial accelerometry rather than arbitrary biosignals.

Citation

Self-supervised learning for human activity recognition using 700,000 person-days of wearable data

Yuan, H., et al. (2022) Self-supervised learning for human activity recognition using 700,000 person-days of wearable data. npj Digital Medicine.

DOI: 10.1038/s41746-024-01062-3

Recent citations

Papers that recently cited this model.

Toward Wearable Sensor-Based Human Activity Recognition: A Survey
Hailin Zou, Zijie Chen, Yuanyuan Pan, et al.
IEEE Internet of Things Journal · Jul 2026
0
Association between wrist-worn actigraphy and the MDS-UPDRS Parkinson’s disease rating scale through machine learning: an exploratory study
Gent Ymeri, S. Caramaschi, Alban Haton, et al.
Frontiers in Digital Health · Jul 2026
0
Learning When to Intervene on Habitual Behaviors: A Case Study in Oral Health Care
Bhanuteja Gullapalli, Vivek Shetty, Anna L. Trella, et al.
Jul 2026
0

Top citations

The most-cited papers that cite this model.

PaPaGei: Open Foundation Models for Optical Physiological Signals
Arvind Pillai, Dimitris Spathis, F. Kawsar, et al.
International Conference on Learning Representations · Oct 2024
72
Scaling Wearable Foundation Models
Girish Narayanswamy, Xin Liu, Kumar Ayush, et al.
International Conference on Learning Representations · Oct 2024
58
CAPTURE-24: A large dataset of wrist-worn activity tracker data collected in the wild for human activity recognition
Shing Chan, Yuan Hang, Catherine Tong, et al.
Scientific Data · Feb 2024
52
The first step is the hardest: Pitfalls of Representing and Tokenizing Temporal Data for Large Language Models
Dimitris Spathis, F. Kawsar
J. Am. Medical Informatics Assoc. · Sep 2023
51
SensorLM: Learning the Language of Wearable Sensors
Yuwei Zhang, Kumar Ayush, Siyuan Qiao, et al.
arXiv.org · Jun 2025
47

Citations

Total Citations210

Influential30

References78

GitHub

Stars162

Forks46

Open Issues6

Contributors6

Last Push1y ago

LanguageJupyter Notebook

Fields of citing research

Computer Science89%
Engineering63%
Medicine49%
Environmental Science7%
Psychology3%
Physics3%
Biology2%
Education1%

Share of papers citing this model.

Openness

bio.rodeo opennessClosed · low usability and reproducibility

28Closed

Usability — can I run it?27

Reproducibility — can I retrain it?17

Model Openness Framework

Unclassified

Restrictive license on core components

Resources

GitHub Repository Research Paper

Key Features

Massive self-supervised pretraining: The feature extractor is pretrained on roughly 700,000 person-days of unlabeled UK Biobank accelerometer data, removing the dependence on costly manual activity annotation.

Multi-task pretext objectives: Pretraining combines three self-supervised tasks — arrow-of-time (temporal direction) prediction, channel permutation detection, and time-warping detection — which jointly teach the network temporally and structurally meaningful motion representations.

Three time-scale variants: Released as HARNet5, HARNet10, and HARNet30 for 5-, 10-, and 30-second windows (150, 300, and 900 samples at 30 Hz), letting downstream users match the model to their labeling granularity.

Cross-device and cross-cohort generalization: Models transfer across external datasets, different participant populations, living environments, and sensor hardware rather than being tied to UK Biobank's specific device.

Drop-in PyTorch Hub access: Pretrained weights load in one line via torch.hub.load('OxWearables/ssl-wearables', 'harnet5', pretrained=True), providing a ready feature extractor with an untrained classification head for fine-tuning.

Technical Details

Applications

Impact

Recent citations

Papers that recently cited this model.

Toward Wearable Sensor-Based Human Activity Recognition: A Survey

Hailin Zou, Zijie Chen, Yuanyuan Pan, et al.

IEEE Internet of Things Journal · Jul 2026

Association between wrist-worn actigraphy and the MDS-UPDRS Parkinson’s disease rating scale through machine learning: an exploratory study

Gent Ymeri, S. Caramaschi, Alban Haton, et al.

Frontiers in Digital Health · Jul 2026

Learning When to Intervene on Habitual Behaviors: A Case Study in Oral Health Care

Bhanuteja Gullapalli, Vivek Shetty, Anna L. Trella, et al.

Jul 2026

SSL-Wearables (HARNet)

#Key Features

#Technical Details

#Applications

#Impact

Citation

Self-supervised learning for human activity recognition using 700,000 person-days of wearable data

Recent citations

Learning When to Intervene on Habitual Behaviors: A Case Study in Oral Health Care

Top citations

Related models

Citations

GitHub

Fields of citing research

Openness

Tags

Resources

SSL-Wearables (HARNet)

#Key Features

#Technical Details

#Applications

#Impact

Citation

Self-supervised learning for human activity recognition using 700,000 person-days of wearable data

Recent citations

Learning When to Intervene on Habitual Behaviors: A Case Study in Oral Health Care

Top citations

Related models

Citations

GitHub

Fields of citing research

Openness

Tags

Resources

Key Features

Technical Details

Applications

Impact

Key Features

Technical Details

Applications

Impact