This is used when indexing Content and matching Content fields.
Actual format of the returned value depends on the search engine
implementation, meaning engines should override common implementation
as needed, but the same input should be handled across engines.