Daniel Bosah wrote:
> new_list = [x.encode('latin-1') for x in sorted(paul)]
I don't see why you would need bytes
> search = "(" + b"|".join(new_list).decode() + ")" + "" #re.complie needs
when your next step is to decode it. I'm not sure why it even works as the
default encoding is usually
On 20/06/18 20:32, Daniel Bosah wrote:
> reg = pattern.findall(str(soup))
>
> for i in reg:
> if i in reg and paul: # this loop checks to see if elements are in
> both the regexed parsed list and the list.
No it doesn't. It checks if i is in reg and
if paul is non empty - which it
On 20/06/18 20:32, Daniel Bosah wrote:
> # coding: latin-1
> from bs4 import BeautifulSoup
> from urllib.request import urlopen
> import re
>
> #new point to add... make rest of function then compare a list of monuments
> notaries ( such as blvd, road, street, etc.) to a list of words containing
# coding: latin-1
from bs4 import BeautifulSoup
from urllib.request import urlopen
import re
#new point to add... make rest of function then compare a list of monuments
notaries ( such as blvd, road, street, etc.) to a list of words containing
them. if contained, pass into new set ( ref notes in