data-mining
DBSCAN algorithm and clustering algorithm for data mining
How do you i开发者_运维技巧mplement DBSCAN algorithm on categorical data (mushroom data set)?[详细]
2023-02-26 21:34 分类:问答How do the Sho dll's from Microsoft Research compare to the open-source Math.NET numerics project
I\'m considering Sho from the perspective of developing general .NET applications and something that has been difficult for me in the past is开发者_Python百科 that there is NO standard math library fo[详细]
2023-02-26 03:23 分类:问答Association Rule Mining on a FOAF dataset of social networks
I am working on a project called \"association rule discovery from social network data: Introducing Data Mining to the Semantic Web\". Can anyone suggest a good source for an algorithm (and its code.[详细]
2023-02-26 03:11 分类:问答Searching for effective way to store graph with 3 million vertices in MySQL
The goal is to make many cycled chains in graph with 3 million vertices. The question is how to store edges in MySQL database and maintain fast speed, searching cycled ch开发者_如何学编程ains, using[详细]
2023-02-25 07:45 分类:问答How to organize data for Mutllevel modeling - Decision Tree, Classification, or Regression
I have three tables - Sales Manager, Customer, and Order. Each sales manager has multiple customers, and each customer can have multiple orders.[详细]
2023-02-25 06:03 分类:问答How do you represent data records containing non-numerical features as vectors (mathematical, NOT c++ vector)?
Many data mining algorithms/strategies use vector representation of data records in order to simulate a spatial representation of the data (like support vector machines).[详细]
2023-02-23 08:17 分类:问答IN WEB scripting specially in PHP What could be the possible information we can store on the client?
Sessions and cookies are the basic storing of session but are there any several ways to store an information on client temporary files or maybe on its browser?[详细]
2023-02-23 04:49 分类:问答Tool for text classification
I am interested in learning about text classification so is reading up on the theory. Next step is doing stuff and therefore I am looking for and at different tools. Some links point to WEKA, however[详细]
2023-02-23 01:31 分类:问答Classify documents with tags
I have a huge amount of documents (mainly pdfs and doc\'s) I want to classify, so I can search over them according to certain tags. These tags could either be of my own (I put the tags to the document[详细]
2023-02-22 18:29 分类:问答Matching Based on Arbitrary Categories and Similarity Measures
开发者_Python百科I have customer database who have certain attributes, and a customer type. The collection of attributes can vary (they do come from a finite set though), and when I look at a new cust[详细]
2023-02-22 04:52 分类:问答