data-mining
Rapid miner: CSV with real numbers with commas instead of dots
I have a problem importing a CSV file with RapidMiner. Floating point values are written with commas instead of the separating dot between the integer and decimal values开发者_开发知识库.[详细]
2023-03-03 13:22 分类:问答Structured text and unstructured text
With respect to the data mining, what are the differences between structured text and unstructured text? What are the major considerations when c开发者_如何学运维hoosing/developing data mining approac[详细]
2023-03-03 00:37 分类:问答how to use Oracle integrated data mining functions to detect outliers present in the data set
I have installed oracle10g enterprise edition on my computer. I want to find the outliers from the dataset, how this can be achieved using the dbms_data_mining_transform package. I knew simple statist[详细]
2023-03-02 19:38 分类:问答Efficiently counting co-occurrences in a large dataset
Came across this interview programming test recently: You\'re given a list of top 50 favorite artists for 1000 users (from last.fm)[详细]
2023-03-02 19:32 分类:问答Is it possible to do HTML scraping , data mining through Python?
Can I gather intelligent data , HTML scraping using python? I have no knowledge of it , so I woul开发者_运维问答d like to get some idea.Look at the module scrapy:[详细]
2023-03-02 12:31 分类:问答Java text clustering library
Which of the data mining java libraries can d开发者_JS百科o text clusterization?Check this tututorial http://alias-i.com/lingpipe/demos/tutorial/cluster/read-me.html.[详细]
2023-03-02 08:53 分类:问答What techniques are there to extract a navigational menu from a web page?
I\'m looking for a method to extract a menu used for navigation from a web page heavy with links (and probably text). The pages I\'m interested in are quite plain, valid XHTML, and it\'s a safe assump[详细]
2023-03-01 15:22 分类:问答Is there a supervised learning algorithm that takes tags as input, and produces a probability as output?
Let\'s say I want to determine the probability that I will upvote a question on SO, based only on which tags are present or absent.[详细]
2023-03-01 09:32 分类:问答How to automatically classify words in the dictionary?
I have a large dictionary file, dic.txt (its actually the SOWPODS) with one word from the English language per line. I want to automatically split this file into 3 different files easy_dic.txt (most c[详细]
2023-02-28 14:41 分类:问答Pls suggest a classifer for this two class problem with memory constraint
Basically I have a device which must be toggled on or off depending on the time. There is a function which checks every 10 minutes and depending on previous data of whether the light was on or off the[详细]
2023-02-27 00:01 分类:问答