Why the "Bing Map" and "ContextCapture 3D Map" are different?

Hello.

Why the "Bing Map" and "ContextCapture 3D Map" are different?

i did run the ContextCapture using "WGS 84 / UTM zone 52N (EPSG:32652) + EGM96 geoid height (EPSG:5773)".

My iModel location is "Tokyo".
Couldn't Tokyo use the "WGS 84 / UTM zone 52N (EPSG:32652) + EGM96 geoid height (EPSG:5773)"?