For best results, use 20-30 training images. Here's why:
Upload ZIP file (under 4MB) with images. Training takes ~3min for 2.2MB ZIP.
Optional caption/mask files supported