I have a table of hospital names. I want to determine what single and tuples in each hospital's name is unique (single occurence in the table) identification purposes. Ideally I want to exclude "fluff" words like "of" from the query where those words have a frequency that exceeds "X" appearances. I would like a query that gives me this result from the table:
RH 1
Dedman 1
RH Dedman 1
Mother Francis 1
Francis 1
Death 1
Vincennes 1
Regional 1
Mercy 1
Tulsa 1
JPS 1
Where the table has these records: "Mother Francis Memorial Hospital and Clinic of Tulsa" and "Angel of Death Health and Medical Center" and "Mother Angel of Mercy memorial Hospital", "Vincennes Medical Center" and "JPS Health Clinic", "RH Dedman Memorial Hospital".
In this example, let's say "X" is 3, therefore "Hospital", medical", "medical center" , "of" , "and", memorial" are not a part of determining what is a one-apperance tuple (obviously would not be unique one word).