[1,2], [1,1], [1000, 1000], [500, 500]] kmeans = KMeans.new(data, :centroids => 2) kmeans.inspect # Use kmeans.view to get hold of the un-‐inspected array => [[3, 4], [0, 1, 2]]
four miles from Westwood, according to the U.S. Geological Survey. The temblor occurred at 7:23 a.m. Pacific time at a depth of 4.3 miles. A magnitude 4.4 earthquake was reported at 6.25 a.m. and was felt over a large swath of Southern California. According to the USGS, the epicenter of the aftershock was five miles from Beverly Hills, six miles from Santa Monica and six miles from West Hollywood. In the last 10 days, there has been one earthquake of magnitude 3.0 or greater centered nearby. This information comes from the USGS Earthquake Notification Service and this post was created by an algorithm written by the author. http://lat.ms/1lTIGqa
'Uninteresting' b.train_interesting "here are some good words. I hope you love them" b.train_uninteresting "here are some bad words, I hate you" b.classify "I hate bad words and you" # returns 'Uninteresting'
strings = [ ["This text deals with dogs. Dogs.", :dog], ["This text involves dogs too. Dogs! ", :dog], ["This text revolves around cats. Cats.", :cat], ["This text also involves cats. Cats!", :cat], ["This text involves birds. Birds.",:bird ]] strings.each {|x| lsi.add_item x.first, x.last} lsi.search("dog", 3) # returns => ["This text deals with dogs. Dogs.", "This text involves dogs too. Dogs! ", # "This text also involves cats. Cats!"] lsi.find_related(strings[2], 2) # returns => ["This text revolves around cats. Cats.", "This text also involves cats. Cats!"] lsi.classify "This text is also about dogs!" # returns => :dog
"likes:apples", "likes:red"] c = ["likes:apples", "likes:red"] # Determines how similar a pair of sets are Jaccard.coefficient(a, b) #=> 0.25 Jaccard.coefficient(a, c) #=> 0.0 Jaccard.coefficient(b, c) #=> 0.6666666666666666 # According to the input data, b and c have the most similar likes.