data-mining
random unit vector in multi-dimensional space
I\'m working on a data mining algorithm where i want to pick a random direction from a particular poin开发者_如何学JAVAt in the feature space.[详细]
2023-03-11 03:27 分类:问答Extract to MongoDB for analysis
I have a relational database with about 30开发者_运维百科0M customers and their attributes from several perspectives (360).[详细]
2023-03-10 01:12 分类:问答Generating counts from closed frequent itemsets
I am reading note where it seems it is said: Given collection of all closed frequent itemsets and their support counts, the support count of any frequent itemset can be obtained.[详细]
2023-03-08 01:51 分类:问答Outlier detection in data mining [closed]
Closed. This question is off-topic. It is not currently accepting answers. Want to improve this question? Update the question so it's on-topic for Stack Overflow.[详细]
2023-03-06 15:05 分类:问答Adding CURE clustering algorithm to WEKA
I have written a java program to perform CURE clustering. 开发者_运维百科I wish to add this program to weka as a clustering algorithm and visualize the clustering.[详细]
2023-03-06 13:59 分类:问答In TeamCity, is there a way of seeing a report of tests ordered by failed-most-often across the whole history?
We have some unreliable tests - unreliable because of environmental reasons. We\'d like to see a history of which tests have failed the most often, so we can drill into why and fix the environment is[详细]
2023-03-05 03:10 分类:问答How do you perform bootstrapping and remove outliers in Weka?
I am just starting to play around with the Weka API and a couple of the example data sets, but just wanted to understand a couple bits and pieces. Does anyone know how to perform 0.632 bootstrapping i[详细]
2023-03-05 02:49 分类:问答Extracting information from millions of simple but inconsistent text files
We have millions of simple txt documents containing various data structures we extracted from pdf, the text is printed line by line so all formatting is lost (because when we tried tools to maintain t[详细]
2023-03-04 12:07 分类:问答Data Mining Resources for C#
I wonder if we could compile a list of resources for Data Mining in C#? Specifically I am looking for[详细]
2023-03-04 11:54 分类:问答Reducing Dimension of datasets and implementation [closed]
It's difficult to tell what is being asked here. This question is ambiguous, vague, incomplete, overly broad, or rhetorical andcannot be reasonably answered in its current form. For help clari[详细]
2023-03-03 21:01 分类:问答