Skip to main content

All Questions

1 vote
1 answer
28 views

My Beautiful Soup library is not extracting out the all the anchor elements from a listed display

Hi so I am very new to web scraping and I am trying out the basics for it. Right now, I wanted to extract links from a root website (coventry.gov.uk). The problem was, however, I could not get the ...
Gs can't's user avatar
1 vote
1 answer
78 views

python: parse html document with UNNESTED div tags into dataframe (using beautifulsoup)

long time user, but never had to ask my own question. I want to use python to parse a table from an html document into a dataframe. The table is NOT an html table, I think it is javascript created ...
tailor's user avatar
  • 15
1 vote
1 answer
148 views

Trouble finding Youtube view count

following a youtube tutorial on scraping youtube views and video date [https://www.youtube.com/watch?v=Cc3mMH8XWC4] I made a dataframe of every video, it has the views, clean_views, video_url, ...
Rango00's user avatar
  • 27
0 votes
1 answer
59 views

Select all <table> elements without classes or ids with BeautifulSoup

I am trying to select all <table> elements on some web pages with BeautifulSoup. The table elements do not have specific classes or ids. import bs4 import requests def get_keycode_soup(url): ...
will-hedges's user avatar
  • 1,284
0 votes
0 answers
60 views

Python: Replacing Text Split Across HTML Tags

I'm working on a Python function to search for and replace a string within an HTML document, where the string might be broken up by HTML tags. I need a solution that accurately handles these cases ...
Rookleton's user avatar
0 votes
3 answers
526 views

"No module named bs4"

I am trying to use beautiful soup in VSCode on Windows 10; I was told to import beautiful soup with the line "from bs4 import Beautiful Soup" but I keep getting the error message that the ...
John Cleary's user avatar
-2 votes
1 answer
58 views

How to Extract Content Inside Each <a href> Tag?

I am working on a project that involves extracting some data from the website. Specifically, I am interested in pulling out the name of each category along with its description. I have considered ...
Alex VIJU's user avatar
0 votes
2 answers
22 views

Python BeautifulSoup4 Finding Attributes

Right now i am building a web scraper to grab the actual href link after the tag to then continue and make a file with all of the values I grab. I just want to grab the "/groups/1234123" ...
Notsuj's user avatar
  • 19
0 votes
0 answers
13 views

Beautiful soup and inspect show different things while scraping [duplicate]

I want to scrape a table from a website. I found all that the 'tr's correspond to the rows of the table, which are under 'table' with class 'StyledTableBody-c11n-8-64-1__sc-8i1s74-0 hLYlju'. In my ...
Quite Orange's user avatar
0 votes
2 answers
74 views

Why is my code print out the same html link a lot of times?

I'm doing a following link activity on Python ( it's an assignment on Python Web Access Data - Coursera). Here is the problem: In this assignment you will write a Python program that expands on http:/...
Vinh Nguyễn Thành's user avatar
0 votes
0 answers
46 views

HTML code differences when trying to parse code

I am having an issue with HTML parsing a website using Selenium. I have the parent variable (individual message), and their nested classes I am trying to extract (name, content, timestamp). The issue ...
CuriousRhino's user avatar
1 vote
1 answer
47 views

Beautiful Soup only gets header of table

I am trying to import the data from a table on this website to a csv:http://www.ameren.com/illinois/residential/supply-choice/renewables/interconnection-queue. I have tried many different solutions, ...
user22062084's user avatar
1 vote
1 answer
32 views

Python: How can i get a list of li tags in BeautifulSoup4

I'm trying to scrape a persian webpage and i want to get 3 li tags from a ul containing 6 of them. my problem is that every li, has nested li tags in it and when i use soup.find_all('li'), it finds ...
Seyedmahdi moosavyan's user avatar
-2 votes
1 answer
86 views

'module' object is not callable in python

!pip install bs4 !pip install html5lib import pandas as pd !pip install lxml import requests import bs4 as BeautifulSoup url = "https://cf-courses-data.s3.us.cloud-object-storage.appdomain.cloud/...
Inam Buneri's user avatar
-1 votes
1 answer
41 views

Web scrapping Dermnet Images

trying to web scrape images from text but can't access the div block using Beutiful soup any solutions ? I also tried usng selenium but its still not working.The website is using sava script to ...
sanchay vashist's user avatar

15 30 50 per page
1
2 3 4 5
71