Skip to Content
Find More Like This
Return to Search

Method and system of filtering and recommending documents

United States Patent

February 9, 2016
View the Complete Patent at the US Patent & Trademark Office
Oak Ridge National Laboratory - Visit the Partnerships Directorate Website
Disclosed is a method and system for discovering documents using a computer and providing a small set of the most relevant documents to the attention of a human observer. Using the method, the computer obtains a seed document from the user and generates a seed document vector using term frequency-inverse corpus frequency weighting. A keyword index for a plurality of source documents can be compared with the weighted terms of the seed document vector. The comparison is then filtered to reduce the number of documents, which define an initial subset of the source documents. Initial subset vectors are generated and compared to the seed document vector to obtain a similarity value for each comparison. Based on the similarity value, the method then recommends one or more of the source documents.
Patton; Robert M. (Knoxville, TN), Potok; Thomas E. (Oak Ridge, TN)
UT-Battelle LLC (Oak Ridge, TN)
13/ 920,803
June 18, 2013
STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH AND DEVELOPMENT This invention was made with government support under Contract No. DE-AC05-00OR22725 awarded by the U.S. Department of Energy. The government has certain rights in the invention.