All Questions
Tagged with html-parsing beautifulsoup
1,063
questions
1
vote
1
answer
28
views
My Beautiful Soup library is not extracting out the all the anchor elements from a listed display
Hi so I am very new to web scraping and I am trying out the basics for it. Right now, I wanted to extract links from a root website (coventry.gov.uk). The problem was, however, I could not get the ...
1
vote
1
answer
78
views
python: parse html document with UNNESTED div tags into dataframe (using beautifulsoup)
long time user, but never had to ask my own question.
I want to use python to parse a table from an html document into a dataframe. The table is NOT an html table, I think it is javascript created ...
1
vote
1
answer
148
views
Trouble finding Youtube view count
following a youtube tutorial on scraping youtube views and video date [https://www.youtube.com/watch?v=Cc3mMH8XWC4]
I made a dataframe of every video, it has the views, clean_views, video_url, ...
0
votes
1
answer
59
views
Select all <table> elements without classes or ids with BeautifulSoup
I am trying to select all <table> elements on some web pages with BeautifulSoup. The table elements do not have specific classes or ids.
import bs4
import requests
def get_keycode_soup(url):
...
0
votes
0
answers
60
views
Python: Replacing Text Split Across HTML Tags
I'm working on a Python function to search for and replace a string within an HTML document, where the string might be broken up by HTML tags. I need a solution that accurately handles these cases ...
0
votes
3
answers
527
views
"No module named bs4"
I am trying to use beautiful soup in VSCode on Windows 10; I was told to import beautiful soup with the line "from bs4 import Beautiful Soup" but I keep getting the error message that the ...
-2
votes
1
answer
58
views
How to Extract Content Inside Each <a href> Tag?
I am working on a project that involves extracting some data from the website. Specifically, I am interested in pulling out the name of each category along with its description.
I have considered ...
0
votes
2
answers
22
views
Python BeautifulSoup4 Finding Attributes
Right now i am building a web scraper to grab the actual href link after the tag to then continue and make a file with all of the values I grab.
I just want to grab the "/groups/1234123" ...
0
votes
0
answers
13
views
Beautiful soup and inspect show different things while scraping [duplicate]
I want to scrape a table from a website. I found all that the 'tr's correspond to the rows of the table, which are under 'table' with class 'StyledTableBody-c11n-8-64-1__sc-8i1s74-0 hLYlju'.
In my ...
0
votes
2
answers
74
views
Why is my code print out the same html link a lot of times?
I'm doing a following link activity on Python ( it's an assignment on Python Web Access Data - Coursera). Here is the problem:
In this assignment you will write a Python program that expands on http:/...
0
votes
0
answers
46
views
HTML code differences when trying to parse code
I am having an issue with HTML parsing a website using Selenium. I have the parent variable (individual message), and their nested classes I am trying to extract (name, content, timestamp). The issue ...
1
vote
1
answer
47
views
Beautiful Soup only gets header of table
I am trying to import the data from a table on this website to a csv:http://www.ameren.com/illinois/residential/supply-choice/renewables/interconnection-queue.
I have tried many different solutions, ...
1
vote
1
answer
32
views
Python: How can i get a list of li tags in BeautifulSoup4
I'm trying to scrape a persian webpage and i want to get 3 li tags from a ul containing 6 of them. my problem is that every li, has nested li tags in it and when i use soup.find_all('li'), it finds ...
-2
votes
1
answer
86
views
'module' object is not callable in python
!pip install bs4
!pip install html5lib
import pandas as pd
!pip install lxml
import requests
import bs4 as BeautifulSoup
url = "https://cf-courses-data.s3.us.cloud-object-storage.appdomain.cloud/...