Create an application that has a text field to enter a URL (default value: https://raw.githubusercontent.com/mwaskom/seaborn-data/master/penguins.csv). The app should read the CSV file (and cache it) from the URL and display a DataFrame. Add controls to do the following: - Control how many rows are displayed - A checkbox to determine to randomly shuffle the DataFrame Users select one or two variables and you must determine the best way to visualize them.
To upload files, please first save the app
import streamlit as st
import pandas as pd
import numpy as np
@st.cache_data
def load_data(url):
return pd.read_csv(url)
st.title('Penguins Data Visualization')
url = st.text_input('Enter CSV URL:', 'https://raw.githubusercontent.com/mwaskom/seaborn-data/master/penguins.csv')
# Load Data
try:
df = load_data(url)
except Exception as e:
st.error(f'Error loading data: {e}')
df = pd.DataFrame() # Empty DataFrame in case of error
# Controls to control number of displayed rows and shuffling
num_rows = st.slider('Select number of rows to display:', min_value=1, max_value=len(df), value=10)
shuffle = st.checkbox('Shuffle DataFrame')
if shuffle:
df = df.sample(frac=1).reset_index(drop=True)
# Display DataFrame
st.write(f'Displaying {num_rows} rows:')
st.dataframe(df.head(num_rows))
# User selects columns to visualize
columns = df.select_dtypes(include=[np.number]).columns.tolist()
if len(columns) < 2:
st.warning('Not enough numeric columns to visualize.')
else:
x_variable = st.selectbox('Select the X variable:', columns)
y_variable = st.selectbox('Select the Y variable:', columns)
# Create scatter plot if both variables are selected
if x_variable and y_variable:
st.subheader(f'Scatter Plot of {y_variable} vs {x_variable}')
st.line_chart(df[[x_variable, y_variable]].dropna())
Hi! I can help you with any questions about Streamlit and Python. What would you like to know?