Python BeautifulSoup Web Scrape

Python BeautifulSoup Web Scrape

Joe Story (thejoestory)

10 лет назад

29,914 Просмотров

Ссылки и html тэги не поддерживаются


Комментарии:

@JeremyTheDeveloper
@JeremyTheDeveloper - 13.12.2014 03:48

Very clear.  Thanks for linking to the code

Ответить
@cesarzepeda5744
@cesarzepeda5744 - 21.10.2015 07:39

Dope you are helping me learn a lot!

Ответить
@muzzammilmia8060
@muzzammilmia8060 - 17.12.2015 19:27

quick question have you used any other modules apart from BeautifulSoup ?

Ответить
@arie996
@arie996 - 13.02.2016 17:50

Very nice, where are the rest of the videos?

Ответить
@ausername9131
@ausername9131 - 17.02.2016 20:44

Thank you! Just followed along and got this to work with about 2 seconds of previous Python experience. Great lay man's explanations of what each line is doing.

Ответить
@Pghpete5
@Pghpete5 - 09.04.2016 21:57

Would love to see this updated to python 3.x. I know there are only minor differences but for someone learning it makes it difficult to dig thru tutorials in hopes that their 3.x syntax works using 2.x tutorials. I am also very curious as to why python 2.x is still so prevelent. I know some large projects and OS's (RHEL, etc) have both and still rely heavily on python 2 but what are you thoughts on why the addoption is so slow to python3? Trying to pick up a new language where folks still use an old version which very different in some respects can be frustrating at times.

Ответить
@chriscasperian4317
@chriscasperian4317 - 23.05.2016 11:21

not sure if this is a new thing or it's just me
but I've recently tried this again and got a ton of errors
it worked when i changed the following line:

col = row.findAll('td') and changed it to....

col = row.findAll('td', {'class' : 'playertablePlayerName'})

Ответить
@colinsavacool1935
@colinsavacool1935 - 10.09.2016 04:19

I am getting an error on line 11 of the code:
for row in tableStats.findAll('tr')[2:]:

the error message is:
Traceback (most recent call last):
File "ESPN_Mine.py", line 11, in <module>
for row in tableStats.findAll('tr')[2:]:
AttributeError: 'NoneType' object has no attribute 'findAll'

Is there any reason for this?

Ответить
@austin7416
@austin7416 - 20.09.2016 11:30

I had difficulties with this.
1) The screen clarity is poor, so hard to be sure what was being typed. } versus ) or : versus ; for instance.
2) I'm using Python 3.5 Apparantly, this vidoe was done in Python 2. Some commands are different.
import urllib2 is now import urllib
urllib2.urlopen is now urllib.request.urlopen
3) I changed 'html' to 'html5lib'
Note: I installed html5lib first.
4) I changed [2:] to [2] to see what would happen out of curiosity. I got errors. I would like to know more on this.
5) It balked at one of my lines being indented differently. Picky.

Other than that, thanks. I got through it. It was very helpful.

Ответить
@thevibecoder69
@thevibecoder69 - 16.10.2016 14:21

Thanks for this awesome tutorial !

Ответить
@zes3813
@zes3813 - 23.10.2016 19:58

wrr

Ответить
@charliekrajewski3646
@charliekrajewski3646 - 21.01.2017 05:49

Fantastic Joe. It's a practical example helpful in ways that the BSoup documentation isn't.

Ответить
@ruthgerrighart1669
@ruthgerrighart1669 - 05.06.2017 00:19

Hi Joe, Very nice video! You mentioned a demo code, but the link seems to not work. Would be great to have it.

Ответить
@kdineshreddy3254
@kdineshreddy3254 - 25.06.2017 14:30

for row in tableStats.findAll('tr')[2:]:
AttributeError: 'NoneType' object has no attribute 'findAll'

how to deal this error? Can some one help me out?

Ответить
@stevenhymel2364
@stevenhymel2364 - 26.06.2017 01:50

Anyone done this on NFL.com? Trying to scrape each players profile. Any help would be appreciated!

Ответить
@t9footballhd683
@t9footballhd683 - 20.06.2020 15:02

the demo code is not working

Ответить