topic description
paragraph 1 and paragraph 1, respectively, take out the following operating income, net profit attributable to the owner of the parent company, and net profit attributable to the owner of the parent company after deducting non-recurring profits and losses, in the format of tuple, (date, name, value, unit), formatted as 2017-9-30, and stored in list
II. Take out the registered capital of paragraph 2, that is, (2015, registered capital, 7900.00 US dollars), and deposit it into one list
topic sources and their own ideas
text content:
1. The operating income of the company from January to September 2017, the net profit attributable to the owner of the parent company and the net profit attributable to the owner of the parent company after deducting non-recurring profits and losses were 987.6899 million yuan, 140.7998 million yuan and 136.49 million yuan respectively, up 10.27%, 35.63% and 20.30% from January to September 2016, respectively, reflecting the good operation of the company.
2. The main reasons for the company"s higher management expenses in 2015 than in 2014 and lower management expenses in 2016 compared with 2015 are as follows: in March 2015, the company"s registered capital increased from US $79 million to US $91.61256 million. Ningbo Lingqi, Ningbo Lingxi, Ningbo Lingqi, Ningbo Lingxin and Ningbo Lingxi are the employee shareholding platforms of the company, according to the company"s December 2014
related codes
/ / Please paste the code text below (do not replace the code with pictures)
-sharp-*-coding:utf-8-*-
import re
import json
data = ["2017-9-30"]
list = []
file = open(u".txt")
key = file.read().decode("gbk")
print key
pattern = re.compile(u"(?<=1-9).*(?=)")
matcher = re.search(pattern,key)
pass
pattern1 = re.compile(u"")
matcher1 = re.search(pattern1,matcher.group())
data.append(matcher1.group())
pass
pattern2 = re.compile(u"[0-9].*?(?=)")
matcher2 = re.search(pattern2,matcher.group())
data.append(matcher2.group())
data.append("")
data = tuple(data)
list.append(data)
pass
data2 = ["2017-9-30"]
pattern3 = re.compile(u"")
matcher3 = re.search(pattern3,matcher.group())
data2.append(matcher3.group())
pass
pattern4 = re.compile(u"(?<=).*?(?=)")
matcher4 = re.search(pattern4,matcher.group())
data2.append(matcher4.group())
data2.append("")
data2 = tuple(data2)
list.append(data2)
pass
data3 = ["2017-9-30"]
pattern5 = re.compile(u"")
matcher5 = re.search(pattern5,matcher.group())
data3.append(matcher5.group())
pass
pattern6 = re.compile(u"(?<=).*?(?=)")
matcher6 = re.search(pattern6,matcher.group())
data3.append(matcher6.group())
data3.append("")
data3 = tuple(data3)
list.append(data3)
pass
pass
dataTwo = []
patternTwo = re.compile(u"(?<=).*")
matcherTwo = re.search(patternTwo,key)
pass
patternTwo1 = re.compile(u"2015")
matcherTwo1 = re.search(patternTwo1,matcherTwo.group())
dataTwo.append(matcherTwo1.group())
pass
patternTwo2 = re.compile(u"")
matcherTwo2 = re.search(patternTwo2,matcherTwo.group())
dataTwo.append(matcherTwo2.group())
pass
patternTwo3 = re.compile(u"(?<=).*?(?=)")
matcherTwo3 = re.search(patternTwo3,matcherTwo.group())
dataTwo.append(matcherTwo3.group())
dataTwo.append("")
dataTwo = tuple(dataTwo)
list.append(dataTwo)
for i in range(0,4):
print "\n"
for j in list[i]:
print j,
what result do you expect? What is the error message actually seen?
although the data can be extracted according to the requirements now, I feel that this extraction method is too stupid. I hope all the bosses can give us some advice on how to improve it.