Hi,
as I couldn't find any newsgroup specialized on regex, I hope
somebody here can help me.
I have the following kind of strings:
file/start.asp#en:test>top
file/start.asp?load=http:/www.test.com>top
file/test.asp?load=http://www.test.com#en:test;de:test>top
With the following regex I can extract the things I need (everything
between 'en:'..and the last ; or 'top' or the uri's:
((.*)[(/|\\)])(.*\.asp)(\?load=(.*)#)?((.*;en:(.*));)?(>(.*)$)?
But additionally the whole thing can start with http:// or ftp://, e.g.
http://www.start.com?load=http:/www.test.com>top
http://www.start.com?load=http:/www.test.com#en:test;>top
...
But when I modify it like this:
(http://.*|ftp://.*)|((.*)[(/|\\)])(.*\.asp)(\?load=(.*)#)?((.*;en:(.*));)?(>(.*)$)?
it only matches the whole string and doesn't split it.
Maybe anybody has got a clue, I'm a bit stuck with that a the moment.
Best regards,
Andi