This is certainly an interesting idea, but I'm not sure it has a place in either Wikipedia or Wikidata unless we're talking about the clips being notable quotes.
For Wikipedia, if it's just a voice sample - as opposed to a notable quote - the community is going to view it as cruft and remove it from articles, as the majority of users will find a contextless sound clip to be of little encyclopedic value.
For Wikidata, why would we link to an audio sample if it's of no valueto sister projects and no different from other voice samples (except for the license).
I like the idea, don't get me wrong. I just think that the broader community is not going to see the utility in the samples.
Sven