Conclusion & Future Work

Conclusion

WaldoGen demonstrates a full computer vision pipeline for turning an ordinary image into a Where's Waldo game. The implementation combines modern pretrained models with rule based reasoning. Overall, the project creates these games by combining detection, segmentation, depth estimation, compositing, stylization and blending to create Where's Waldo scenes from arbitrary pictures. Our results showcase the benefits and detriments to our approach. The pipeline itself is flexible due to each stage being built separately with debug info generated along the way. The quality of the output depends a lot on how good the pretrained models are as well as our heuristics. In edge case scenes with bad perspective, unusual objects, or other outlier features Waldo may not be properly placed and hidden.

Future work

Future work could improve all aspects of the pipeline. Some ideas are: