How to extract the data from from the html ?
from urllib.request import urlopen
url = 'http://book.ponniyinselvan.in/part-1/chapter-1.html'
page = urlopen(url)
getting HTTPError: HTTP Error 403: Forbidden
I am trying to extract the data into CSV file.

User-Agentheader to get around that. But this is a book of prose. How are going to shove that into a CSV? It doesn't seems like a smart approach.