Ideone.com

download

copy

import re
 
urls = ['http://www.stackoverflow.com/lifestyle/tech/this-is-a-very-nice-headline-my-friend/2013/04/26/acjhrjk-2e1-1krjke4-9el8c-2eheje_story.html?tid=sm_fb',
'http://www.stackoverflow.com/2015/07/15/sports/baseball/another-very-nice.html?smid=tw-somedia&seid=auto',
'http://w...content-available-to-author-only...k.com/news/2013/07/22/54216-hello-another-one-here?lite',
'http://w...content-available-to-author-only...k.com/article_email/hello-one-here-that-is-cool-1545545554-lMyQjAxMTAHFJELMDgxWj',
'http://w...content-available-to-author-only...k.com/2013/11/13/tech/tricky-one/the-real-one/index.html',
'http://w...content-available-to-author-only...k.com/2013/11/13/tech/the-good-one.html',
'http://w...content-available-to-author-only...k.com/news/science-and-technology/54512-hello-world-here-is-a-weird-character#b02g07f20b14']
 
regex = re.compile(r'(?<=/)([-\w]+)(?=[.?/#]|$)')
digits = re.compile(r'-?\d{3,}-?')
 
for url in urls:
	substrings = regex.findall(url)
	longest = max(substrings, key=len)
	headline = re.sub(digits, '', longest)
	print headline

Success #stdin #stdout 0.01s 9016KB

stdin

copy

Standard input is empty

stdout

copy

this-is-a-very-nice-headline-my-friend
another-very-nice
hello-another-one-here
hello-one-here-that-is-coollMyQjAxMTAHFJELMDgxWj
the-real-one
the-good-one
hello-world-here-is-a-weird-character

https://ideone.com/9eHKQt

language:

Python (cpython 2.7.16)

created:

visibility:

public

Share or Embed source code

Discover > Sphere Engine API

The brand new service which powers Ideone!

Discover > IDE Widget

Widget for compiling and running the source code in a web browser!

Discover > Sphere Engine API

Discover > IDE Widget

Choose your language