text-processing
Estimating the word count of a file without reading the full file
I have a program to process very large files. Now I need to show a progress bar to show the progress of the processing. The program works on a word level, read one line at a time, splitting it into wo[详细]
2023-01-11 23:16 分类:问答Extract snippet out of HTML with Ruby?
I need to show the first 100 characters of an HTML text, which means, I have to pick the first 100 characters that are not tags and then close any open tags leaving a balanced HTML. Is there any l开发[详细]
2023-01-10 08:08 分类:问答How to define syntax
I am new at language processing and I want to create a parser with Irony for a following syntax: name1:value1 name2:value2 name3:value ...[详细]
2023-01-10 02:24 分类:问答perl plotting for multiple datasets in .csv file
Merged with perl plotting using GD:Graph. I have a .csv file with the开发者_StackOverflow中文版 following data.(I open CSV file as comma separated file)[详细]
2023-01-07 21:57 分类:问答Splitting words in running text using Python?
I am writing a piece of code which will extract words from running text. This text can contain delimiters like \\r,\\n etc. which might be there in text.[详细]
2023-01-05 20:05 分类:问答Processing OCRed text
I am extracting texts from OCRed Tiff files by using a library and dumping it in database. The text I am extracting are actually FORMS having fields like NAME,DOB,COU开发者_开发技巧NTRY etc. Since OCR[详细]
2023-01-03 18:58 分类:问答Reading email content
Hope so开发者_Go百科meone may be able to help. What i am looking to do is create a small winform app in c# to read the content of a email from a pop account, and upload key values to a sql automatical[详细]
2023-01-03 03:10 分类:问答Script to fix broken lines in a .txt file?
I\'d love like to read books properly on my Kindle. To achieve my dream, I need a script to fix broken lines in a txt file.[详细]
2023-01-02 16:57 分类:问答Data extraction from source with lots of white space
I\'m trying to extract data from : http://www.phillysheriff.com/old_site/properties.html Ideally I\'d be able to get a CSV file with the address, ward, price, and 开发者_JAVA百科square feet? Is there[详细]
2023-01-01 21:40 分类:问答problem in extracting the data from text file
i am new to python , and I want to extract the data from this format FBpp0143497 5 151 5 157 PF00339.22 Arrestin_N Domain 1 135 149 83.4 1.1e-23 1 CL0135[详细]
2022-12-31 05:07 分类:问答