Create an application that has a text field to enter a URL (default value: https://raw.githubusercontent.com/mwaskom/seaborn-data/master/penguins.csv). The app should read the CSV file (and cache it) from the URL and display a DataFrame. Add controls to do the following: - Control how many rows are displayed - A checkbox to determine to randomly shuffle the DataFrame Users select one or two variables and you must determine the best way to visualize them.
To upload files, please first save the app
import streamlit as st
import pandas as pd
@st.cache_data
def load_data(url):
return pd.read_csv(url)
st.title('Penguins Data Viewer')
url = st.text_input('Enter URL of the CSV file:',
value='https://raw.githubusercontent.com/mwaskom/seaborn-data/master/penguins.csv')
# Load data from the provided URL
try:
df = load_data(url)
st.write('Data loaded successfully!')
except Exception as e:
st.error(f'Error loading data: {e}')
# Checkbox for shuffling data
shuffle = st.checkbox('Shuffle DataFrame')
if shuffle:
df = df.sample(frac=1).reset_index(drop=True)
# Control for number of rows displayed
num_rows = st.number_input('Number of rows to display:', min_value=1,
max_value=len(df), value=5)
# Display the DataFrame
st.write(df.head(num_rows))
# Variable selection for visualization
variables = df.select_dtypes(include=['object', 'number']).columns.tolist()
selected_vars = st.multiselect('Select variables to visualize:', variables, max_selections=2)
if len(selected_vars) == 2:
st.write('Creating a scatter plot...')
st.pyplot(df.plot.scatter(x=selected_vars[0], y=selected_vars[1]))
elif len(selected_vars) == 1:
st.write('Creating a histogram...')
st.pyplot(df[selected_vars[0]].hist())
Hi! I can help you with any questions about Streamlit and Python. What would you like to know?