fork download
  1. import re
  2.  
  3. s='''RECORD1 Sed similique nostrum quibusdam minus. Rerum repudiandae et ipsum numquam commodi repellendus. Aut minima ratione vel
  4. beatae minima reprehenderit provident neque. Earum quam temporibus repudiandae quidem officiis
  5. RECORD2 Sed similique nostrum quibusdam minus. Rerum repudiandae et ipsum numquam commodi repellendus. Aut minima ratione vel
  6. beatae minima reprehenderit provident neque. Earum quam temporibus repudiandae quidem officiis
  7. RECORD3 It is a long established fact that a reader will be distracted by the readable content of a page when looking at its
  8. layout.
  9. RECORD4 '''
  10. regex = r"(?s)(RECORD.*?\s).*?(?=(RECORD.*?\s))"
  11. matches = re.finditer(regex, s)
  12. for matchNum, match in enumerate(matches):
  13. matchNum = matchNum + 1
  14. print ("Match {matchNum} was found at {start}-{end}: {match}".format(matchNum = matchNum, start = match.start(), end = match.end(), match = match.group()))
  15. for groupNum in range(0, len(match.groups())):
  16. groupNum = groupNum + 1
  17. print ("Group {groupNum} found at {start}-{end}: {group}".format(groupNum = groupNum, start = match.start(groupNum), end = match.end(groupNum), group = match.group(groupNum)))
Success #stdin #stdout 0.01s 6936KB
stdin
Standard input is empty
stdout
Match 1 was found at 0-223: RECORD1  Sed similique nostrum quibusdam minus. Rerum repudiandae et ipsum numquam commodi repellendus. Aut minima ratione vel 
beatae minima reprehenderit provident neque. Earum quam temporibus repudiandae quidem officiis

Group 1 found at 0-8: RECORD1 
Group 2 found at 223-231: RECORD2 
Match 2 was found at 223-445: RECORD2 Sed similique nostrum quibusdam minus. Rerum repudiandae et ipsum numquam commodi repellendus. Aut minima ratione vel 
beatae minima reprehenderit provident neque. Earum quam temporibus repudiandae quidem officiis

Group 1 found at 223-231: RECORD2 
Group 2 found at 445-453: RECORD3 
Match 3 was found at 445-582: RECORD3   It is a long established fact that a reader will be distracted by the readable content of a page when looking at its 
layout. 

Group 1 found at 445-453: RECORD3 
Group 2 found at 582-590: RECORD4