Newest 'html-parsing+beautifulsoup' Questions

1 vote

1 answer

28 views

My Beautiful Soup library is not extracting out the all the anchor elements from a listed display

Hi so I am very new to web scraping and I am trying out the basics for it. Right now, I wanted to extract links from a root website (coventry.gov.uk). The problem was, however, I could not get the ...

Gs can't

23

asked Jul 7 at 18:02

1 vote

1 answer

78 views

python: parse html document with UNNESTED div tags into dataframe (using beautifulsoup)

long time user, but never had to ask my own question. I want to use python to parse a table from an html document into a dataframe. The table is NOT an html table, I think it is javascript created ...

tailor

15

asked Feb 22 at 23:19

1 vote

1 answer

148 views

Trouble finding Youtube view count

following a youtube tutorial on scraping youtube views and video date [https://www.youtube.com/watch?v=Cc3mMH8XWC4] I made a dataframe of every video, it has the views, clean_views, video_url, ...

Rango00

27

asked Feb 18 at 15:23

0 votes

1 answer

59 views

Select all <table> elements without classes or ids with BeautifulSoup

I am trying to select all <table> elements on some web pages with BeautifulSoup. The table elements do not have specific classes or ids. import bs4 import requests def get_keycode_soup(url): ...

will-hedges

1,284

asked Jan 30 at 15:14

0 votes

0 answers

60 views

Python: Replacing Text Split Across HTML Tags

I'm working on a Python function to search for and replace a string within an HTML document, where the string might be broken up by HTML tags. I need a solution that accurately handles these cases ...

Rookleton

19

asked Jan 8 at 9:49

0 votes

3 answers

527 views

"No module named bs4"

I am trying to use beautiful soup in VSCode on Windows 10; I was told to import beautiful soup with the line "from bs4 import Beautiful Soup" but I keep getting the error message that the ...

John Cleary

9

asked Dec 23, 2023 at 10:12

-2 votes

1 answer

58 views

How to Extract Content Inside Each <a href> Tag?

I am working on a project that involves extracting some data from the website. Specifically, I am interested in pulling out the name of each category along with its description. I have considered ...

Alex VIJU

13

asked Aug 30, 2023 at 19:58

0 votes

2 answers

22 views

Python BeautifulSoup4 Finding Attributes

Right now i am building a web scraper to grab the actual href link after the tag to then continue and make a file with all of the values I grab. I just want to grab the "/groups/1234123" ...

Notsuj

19

asked Aug 1, 2023 at 21:50

0 votes

0 answers

13 views

Beautiful soup and inspect show different things while scraping [duplicate]

I want to scrape a table from a website. I found all that the 'tr's correspond to the rows of the table, which are under 'table' with class 'StyledTableBody-c11n-8-64-1__sc-8i1s74-0 hLYlju'. In my ...

Quite Orange

1

asked Jul 31, 2023 at 17:21

0 votes

2 answers

74 views

Why is my code print out the same html link a lot of times?

I'm doing a following link activity on Python ( it's an assignment on Python Web Access Data - Coursera). Here is the problem: In this assignment you will write a Python program that expands on http:/...

Vinh Nguyễn Thành

1

asked Jul 27, 2023 at 4:44

0 votes

0 answers

46 views

HTML code differences when trying to parse code

I am having an issue with HTML parsing a website using Selenium. I have the parent variable (individual message), and their nested classes I am trying to extract (name, content, timestamp). The issue ...

CuriousRhino

1

asked Jul 21, 2023 at 10:46

1 vote

1 answer

47 views

Beautiful Soup only gets header of table

I am trying to import the data from a table on this website to a csv:http://www.ameren.com/illinois/residential/supply-choice/renewables/interconnection-queue. I have tried many different solutions, ...

user22062084

11

asked Jun 12, 2023 at 16:40

1 vote

1 answer

32 views

Python: How can i get a list of li tags in BeautifulSoup4

I'm trying to scrape a persian webpage and i want to get 3 li tags from a ul containing 6 of them. my problem is that every li, has nested li tags in it and when i use soup.find_all('li'), it finds ...

Seyedmahdi moosavyan

107

asked Jun 12, 2023 at 15:42

-2 votes

1 answer

86 views

'module' object is not callable in python

!pip install bs4 !pip install html5lib import pandas as pd !pip install lxml import requests import bs4 as BeautifulSoup url = "https://cf-courses-data.s3.us.cloud-object-storage.appdomain.cloud/...

Inam Buneri

1

asked Jun 8, 2023 at 9:17

Collectives™ on Stack Overflow

All Questions

My Beautiful Soup library is not extracting out the all the anchor elements from a listed display

python: parse html document with UNNESTED div tags into dataframe (using beautifulsoup)

Trouble finding Youtube view count

Select all <table> elements without classes or ids with BeautifulSoup

Python: Replacing Text Split Across HTML Tags

"No module named bs4"

How to Extract Content Inside Each <a href> Tag?

Python BeautifulSoup4 Finding Attributes

Beautiful soup and inspect show different things while scraping [duplicate]

Why is my code print out the same html link a lot of times?

HTML code differences when trying to parse code

Beautiful Soup only gets header of table

Python: How can i get a list of li tags in BeautifulSoup4

'module' object is not callable in python

Hot Network Questions

Collectives™ on Stack Overflow

All Questions

Related Tags