Introduction
Related work
Methods
Fine-tuning with DreamBooth
Inference with ControlNet
Experiments
Dataset and implementation details
Style | Denoising | CFG | SoftEdge | Tile |
---|---|---|---|---|
CholecT45 vid52 & vid56 | 0.45 | 4.5 | 0.5 | 0.3 |
CholecT45 vid25 & vid66 | 0.45 | 5.0 | 0.4 | 0.3 |
CholecT45 vid01 & vid49 | 0.5 | 5.0 | 0.55 | 0.3 |
Evaluation metrics
Method | Style | mIoU [%] \(\uparrow \) | FID \(\downarrow \) | KID \(\downarrow \) | \(\textrm{LPIPS}_{\textrm{VGG}} \uparrow \) |
---|---|---|---|---|---|
N/A | Raw simulation images | 24.73 | 305.00 | .3739 ±.0041 | .5820 |
[22] | Random | 45.28 | 110.92 | .1243 ±.0035 | .5834 |
[22] | Cholec80 | 42.21 | 67.13 | .0623 ±.0017 | .6407 |
Ours | CholecT45 vid52 & vid56 | 66.85 | 68.35 | .0658 ±.0015 | .6245 |
Ours | CholecT45 vid25 & vid66 | 69.76 | 63.07 | .0582 ±.0012 | .6262 |
Ours | CholecT45 vid01 & vid49 | 67.20 | 57.47 | .0513 ±.0011 | .6175 |
Ours | Mixed styles | 67.89 | 54.57 | .0473 ± .0011 | .6281 |
Style | No control | Only SoftEgde | Only tile | SoftEdge + Tile |
---|---|---|---|---|
CholecT45 vid52 & vid56 | 61.52 | 65.26 (\(+\) 6.1%) | 64.20 (\(+\)4.4%) | 66.85 (+8.7%) |
CholecT45 vid25 & vid66 | 63.35 | 67.16 (\(+\) 6.0%) | 68.01 (\(+\)7.4%) | 69.76 (+10.1%) |
CholecT45 vid01 & vid49 | 54.29 | 63.26 (\(+\) 16.5%) | 62.08 (\(+\)14.3%) | 67.20 (+23.8%) |