Saturday, 13 August 2022

Day 38 : Scrape Table from a Website using Python

 


import urllib.request

import pandas as pd


#List of publicly listed ITES companies of India

url = "https://en.wikipedia.org/wiki/List_of_publicly_listed_ITES_companies_of_India"


with urllib.request.urlopen(url) as i:

    html = i.read()

    

data = pd.read_html(html)[0]

print(data.head())


#clcoding.com

               Company       Listed  Founded            Revenue  \
0  3i Infotech Limited  BSE: 532628     1993     US$239 million   
1     HCL Technologies  BSE: 532281     1976    US$5.36 billion   
2              Infosys  BSE: 500209     1981    US$8.24 billion   
3    KPIT Technologies  BSE: 532400     1990  US$444.32 million   
4               Mastek  BSE: 523704     1982  US$150.43 million   

                 Profit Headcount Reference  
0  US$84 million (2014)      9000       [1]  
1                   NaN    197777       [2]  
2       US$1.75 billion    169638       [3]  
3                   NaN     10291       [4]  
4       US$8.37 million      3352       [5]  

0 Comments:

Post a Comment

Popular Posts

Categories

AI (33) Android (24) AngularJS (1) Assembly Language (2) aws (17) Azure (7) BI (10) book (4) Books (146) C (77) C# (12) C++ (82) Course (67) Coursera (198) Cybersecurity (24) data management (11) Data Science (106) Data Strucures (8) Deep Learning (13) Django (14) Downloads (3) edx (2) Engineering (14) Excel (13) Factorial (1) Finance (6) flask (3) flutter (1) FPL (17) Google (21) Hadoop (3) HTML&CSS (47) IBM (25) IoT (1) IS (25) Java (93) Leet Code (4) Machine Learning (46) Meta (18) MICHIGAN (5) microsoft (4) Nvidia (1) Pandas (3) PHP (20) Projects (29) Python (893) Python Coding Challenge (285) Questions (2) R (70) React (6) Scripting (1) security (3) Selenium Webdriver (2) Software (17) SQL (42) UX Research (1) web application (8)

Followers

Person climbing a staircase. Learn Data Science from Scratch: online program with 21 courses