Ideone.com

fork download

import requests
from bs4 import BeautifulSoup
import time
import schedule
 
 
def access():
    url = 'https://f...content-available-to-author-only...e.com/c/marketplace/sales-ads/'
    try:
        r = requests.get(url)
        r.raise_for_status()
        open('sales-ads.html', 'wb').write(r.content)
    except requests.exceptions.HTTPError as err:
        print(err)
 
 
def extraction():
    with open('sales-ads.html') as file:
        src = file.read()
        soup = BeautifulSoup(src, 'lxml')
    with open('topics.txt', 'w') as f:
        topic_names = soup.find_all('a', class_='title raw-link raw-topic-link')
        for item in topic_names:
            item_text = item.text
            item_url = item.get('href')
            print(f"{item_text}: {item_url}", file=f)
 
 
schedule.every(5).seconds.do(access)
schedule.every(5).seconds.do(extraction)
 
while True:
    schedule.run_pending()
    time.sleep(1)

Runtime error #stdin #stdout #stderr 0.39s 34332KB

stdin

Standard input is empty

stdout

Standard output is empty

stderr

Traceback (most recent call last):
  File "./prog.py", line 2, in <module>
ModuleNotFoundError: No module named 'bs4'

https://ideone.com/XEmho0

language:

Python 3 (python 3.12)

created:

visibility:

secret

Share or Embed source code

Discover > Sphere Engine API

The brand new service which powers Ideone!

Discover > IDE Widget

Widget for compiling and running the source code in a web browser!

Discover > Sphere Engine API

Discover > IDE Widget

Choose your language