Deep learning in practice: ViT image classification and DQN multi-agent

ViT image classification (Kaggle Top 10%)

ViT/DeiT fine-tuning, with the key techniques being: layer-wise learning rates (small steps for lower layers, larger for the top), Label Smoothing, and a RandAug/Mixup/CutMix augmentation combo — validation accuracy around 98%, competition ranking Top 10%.

DQN multi-agent transport

4 Agents doing round-trip transport in a 5×5 grid: a shared-network DQN lowers training cost, and a yield-priority mechanism resolves path conflicts — a 95% success rate, with average steps cut by 20%. The "coordinate and yield" design in multi-agent is the same class of problem as the conflict arbitration in today's multi-agent systems.

Deep learning in practice: ViT image classification and DQN multi-agent

Problem

Approach

Results

AI's role in this project

ViT image classification (Kaggle Top 10%)

DQN multi-agent transport