stanfordjournalism / data-journalism-notebooks Goto Github PK
View Code? Open in Web Editor NEWData journalism notebooks for Stanford Graduate Journalism Program
Home Page: https://stanfordjournalism.github.io/data-journalism-notebooks/
Data journalism notebooks for Stanford Graduate Journalism Program
Home Page: https://stanfordjournalism.github.io/data-journalism-notebooks/
Create screencasts about each step in the workflow and submission process:
Links on Python TOC should point to JupyterLite.
Downloading files tutorial should Open in Codespaces
These should mirror and expand upon the skills covered in 1st Jupyter NB, and fill in missing gaps such as reshaping data, stacking data, etc.
Update Syntax Crash Course to include a brief section on accessing items in lists using position/index.
Update python_overview.ipynb as detailed below:
Flesh out in more detail to mention:
TK - GitHub CodeSpaces
Explain how WebAssembly is now making it possible to run Python itself directly in your browser. This means that you don't need to install the software, nor do you need to use a third party such as Google to host the notebook for you.
It's a super-convenient way to learn without having to slog through the process of setting up your own local installation of Python, Jupyter and related libraries.
However, there are drawbacks. These installations are not intended for handling large quantities of data, and there are limitations and friction points when it comes to saving work and normal day-to-day usages of Python, such as idiosyncratic workflows for the very common case of obtaining files from other websites, e.g. when scraping a government agency for data or documents.
In our opinion, there's a time and a place for each of these different coding contexts.
JupyterLite -- ie Python in your Browser -- is a great way to start ramping up immediately. It's so handy that the First Python Notebook is actually a JupyterLite instance that requires no installation of Python or related libraries for you to get started.
But when you're working on projects, we prefer other options. A plain old code editor is handy for whipping up Python scripts or multi-step pipelines which need to run on a regular schedule on a virtual machine in the cloud. These types of machines typically have no graphical interface, and while you can run Jupyter Notebooks as scripts in a shell, it's far more common and convenient to use plain old Python scripts.
For data analysis, we of course recommend Jupyter Notebooks/Lab, either running in your browser or using a third party provider such as Google Colab.
When starting out, it can be tempting to choose convenience (e.g. Google Colab) over learning the slightly harder but more standard way of doing things. In this course, we'll take the latter route, primarily because we want you to learn standard workflows that most teams in the news use, and many of the tutorials and blog posts assume out on the wider Internet. That said, we're very excited about CodeSpaces, which combine standard workflows with a zero setup environment based entirely in the cloud. While it too has limitations in terms of pricing and resources, it's a convenient way to get up and running on real work, using standard practices.
Last but not least, even the humble Python interpreter in your shell can be handy for quickly testing out code snippets and exploring a library, without the overhead of having to install and run a Jupyter Notebook.
Folks are accidentally creating files in hidden config directories rather than the project root using Codespaces.
Create a screencast that shows:
git mv
Can we update the build process so that GH Pages serves both JupyterLite notebooks and JupyterBook style Markdown files?
" one two three ".strip().upper().split()
First, do some googling to see if one exists.
If not, write an extension (likely need a Server Side component since it requires using a secret API key for ChatGPT):
Add animals.txt, FDIC banks list, etc.
Remove fake_file.csv
This might be a better alternative to JupyterLite, since it avoids the limitations with respect to fetching external data due to CORS restrictions in Pyodide
Python basics
and Data Analysis
requests
simply doesn't work and pyodide's pyfetch can't call sites other than the origin due to CORS restrictions. Basically, the only option is to use a file that emanates from the same domain or is packaged along with the JupyterLite buildA declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.