I'm looking for a way to approximate the reading level of documents for a search.
I figure, a good start may be sentence length, vocabulary, and sentence structure.
Do you have any functions in place to assist in any of these tasks?
There isn't a "reading level" function in Texis, however you can probably get some approximations using rex. You can use our sentence expression:
[^\digit\upper][.?!][\space'"]
to gain an approximate count of sentences, and thus sentence length. You can use rex to find long words, which is a good indication of advanced vocabulary. You can also use rex to find commas, semi-colons etc to estimate the complexity of the sentence structure.