I'm looking for a pattern in Regex in Python to do the following:
For a text formatted like:
2021-01-01 10:00:05 - Surname1 Name1 (Comment)
Blablabla
Blabla
2021-01-01 23:00:05 - Surname2 SurnameBis Name2 (WorkNotes)
What?
I don't know?
2021-01-02 03:00:05 - Surname1 Name1 (Comment)
Blablabla!
I would like to return:
[(2021-01-01,10:00:05,Surname1 Name1,Comment,Blablabla/nBlabla),
(2021-01-01,23:00:05,Surname2 SurnameBis Name2,WorkNotes,What?/nI don't know?),
(2021-01-02,03:00:05,Surname1 Name1,Comment,Blablabla!)]
I managed to find a quiet close result with:
text2 = """2021-01-01 10:00:05 - Surname1 Name1 (Comment)
Blablabla
Blabla
2021-01-01 23:00:05 - Surname2 SurnameBis Name2 (WorkNotes)
What?
I don't know?
Can you be clear?
2021-01-02 03:00:05 - Surname1 Name1 (Comment)
Blablabla!"""
LangTag = re.findall("(\d{4}-\d{2}-\d{2})\s(\d{2}:\d{2}:\d{2})\s-\s(.*?)\((.*)\)\\n(.*)(?:\\n|$)", text2)
print(LangTag)
But I'm totally stuck to make appears the entire text I need to get...

a solution can be to remove the \n from initial text but, I would like to avoid it because I need them later on... Any idea?