bigdata
SQL: Joins vs Denormalization (lots of data)
I know, variations of this question had been asked before. But my case may be a little different :-) So, I am building a site that tracks events. Each event has id and value. 开发者_高级运维It is als[详细]
2023-04-12 15:42 分类:问答Handling large String lists in java
I\'ve got a task, where I\'ve got to go through several billion string lines and check, whether each of those is unique. All the lines themselves cannot be accommodated within the RAM memory of the PC[详细]
2023-04-10 04:48 分类:问答Big Data convert to "transactions" from arules package
The arules pa开发者_Go百科ckage in R uses the class \'transactions\'. So in order to use the function apriori() I need to convert my existing data. I\'ve got a Matrix with 2 columns and roughly 1.6mm[详细]
2023-04-01 14:10 分类:问答sorting large text data
I have a large file (100 million lines of tab separated values - about 1.5GB in size). What is the fastest开发者_StackOverflow中文版 known way to sort this based on one of the fields?[详细]
2023-03-28 16:26 分类:问答High Level Java Optimization
There are many questions and answers and opinions about how to do low level Java optimization, with for, while, and do-while loops, and whether it\'s even necessary.[详细]
2023-03-27 16:11 分类:问答Switch from Mysql to MongoDB 200 millions rows
We are trying to move from mysql to mongodb. mysql structure is id_src int id_dest int unique key : id_src,id_dest[详细]
2023-03-23 05:21 分类:问答Pyrocache vs Memcached
I have used PyroCMS for some projects, I love it so much. I am current developing another website based on it. In my website, I need to work on a big database, it is not really big but big enough to r[详细]
2023-03-21 17:21 分类:问答Rendering massive amount of data
I have a 3D floating-point matrix, in worst-case scenario the size could be (200000x1000000x100), I want to visualize this matrix using Qt/OpenGL.[详细]
2023-03-20 01:51 分类:问答NoSQL Recommendation for a specific need
I\'m using a MySql table with the following simple structure : ID_A : int 8 ID_B : int 8 Primary key : ID_A, ID_B[详细]
2023-03-19 17:44 分类:问答How to take the average of big data in MongoDB vs CouchDB?
I\'m looking at this chart... http://www.mongodb.org/display/DOCS/MongoDB,+CouchDB,+MySQL+Compare+Grid[详细]
2023-03-19 16:57 分类:问答