text-parsing
Python Parsing: Pull data from html test file for non-standard layout
I need help with parsing an html text file that has a layout that I\'m not sure how to parse through, and could really use the help.[详细]
2023-02-07 01:13 分类:问答Where should I store a list of stop words?
My function parses texts and removes short words, such as \"a\", \"the\", \"in\", \"on\", \"at\", etc.[详细]
2023-02-06 20:06 分类:问答Text file parsing using java, suggestions needed on which one to use
I can successfully read text file using InputFileStream and Scanner classes. It\'s very easy but I need to do something more complex than that. A little background about my project first.. I have a de[详细]
2023-02-05 08:19 分类:问答script to convert date (month_name DD, YYYY) to (YYYY-MM-DD)
I have a text file with dates in the form: \"date=month_name DD, YYYY\" and \"date=(month_name DD, YYYY)\"[详细]
2023-02-04 22:06 分类:问答Apostrophes Converted to Correct Text?
Goal:I need to be able to convert apostrophes to properly formed words. - at least for the most common words with apostrophes.To do this ideally I\'d want a list of words and their implied conterparts[详细]
2023-02-04 21:35 分类:问答read text from file containing escape characters to compile time equivalent
I would like to parse a text file or a java property which contains text such as: \"test\\n123\\t456\"[详细]
2023-02-04 16:38 分类:问答How to extract certain value from collection of text files
Say, I have a collection of text files I need to process (e.g. search for a certain label and extract the value). What would be the general way to tackle the problem?[详细]
2023-02-03 06:36 分类:问答Extracting "((Adj|Noun)+|((Adj|Noun)(Noun-Prep)?)(Adj|Noun))Noun" from Text (Justeson & Katz, 1995)
Is it possible to extract ((Adj|Noun)+|((Adj|Noun)(Noun-Prep)?)(Adj|Noun))Noun proposed by Justeson and Katz (1995) using the R package openNLP?[详细]
2023-02-02 11:06 分类:问答Split alphanumeric string between leading digits and trailing letters
I have a string like: $Order_num = "0982asdlkj"; How can I split that into the 2 variables, with the number as one element and then another variable with the letter element?[详细]
2023-02-01 21:01 分类:问答How to clean up garbage text from string using PHP?
I am trying to parse a word document file. I upload the using PHP then I am trying to get contents using file_get_contents(); function but the problem is when its displayed in front end a lots of garb[详细]
2023-01-31 16:24 分类:问答