Surely it would operate much as the various "spoken wikipedia" projects do currently? Though, on reading [[m:Video policy]] I note there are some technical limitations; the 2mb filesize cap would probably come into play for a video of any length.
I think Commons supports larger video files.