matt harrison mattharrison

Python and Data Science trainer and consultant

mattharrison / markov.py

Last active September 24, 2019 19:50

markov chain starter code

	"""
	This is a module docstring. It must be at the TOP
	of the file.

	This is the markov module. You can create
	a markov chain like this:

	>>> m = Markov('ab')
	>>> m.predict('a')
	'b'

mattharrison / scrolltop.js

Created May 31, 2019 20:06

Jupyter cell to create shortcut (ctl-l) that toggles between scrolling to top of cell and output of cell

	%%javascript
	var selectedCell = 0;
	var viewOutput = false;


	Jupyter.keyboard_manager.command_shortcuts.add_shortcut('ctrl-l', function (event) {
	function getOutputScrollValue(cell) {
	var percent = 0;
	var ct = cell.output_area.element.offset().top;
	var sme = Jupyter.notebook.scroll_manager.element;

mattharrison / black_doctest.py

Last active June 22, 2020 20:48

This script runs black on a text file with doctest and blackens them....

	import argparse
	import sys


	import black
	from blib2to3.pgen2.tokenize import TokenError

	TEST_DATA = """
	Normal

mattharrison / gist:9504e38aeaf1d5479b1d3369ab080a57

Last active September 23, 2021 14:23

Load mushrooms

	mush_df = pd.read_csv('../data/agaricus-lepiota.data.txt',
	names='class,cap_shape,cap_surface,cap_color,bruises,'
	'odor,g_attachment,g_spacing,g_size,g_color,s_shape,'
	's_root,s_surface_a,s_surface_b,s_color_a,s_color_b,'
	'v_type,v_color,ring_num,ring_type,spore_color,pop,hab'.split(','))
	mush_df = pd.get_dummies(mush_df, columns=mush_df.columns).drop(['class_e'],axis=1)

mattharrison / gist:16f396ab992d7275c4a080078b9062b2

Created September 27, 2018 18:30

Load mushrooms

	cols = 'class,cap_shape,cap_surface,cap_color,bruises,'\
	'odor,g_attachment,g_spacing,g_size,g_color,s_shape,'\
	's_root,s_surface_a,s_surface_b,s_color_a,s_color_b,'\
	'v_type,v_color,ring_num,ring_type,spore_color,pop,hab'.split(',')
	mush_df = pd.get_dummies(pd.read_csv('../data/agaricus-lepiota.data.txt', names=cols))
	mush_df

mattharrison / Pytest Intro Assignments.rst

Last active October 25, 2021 18:57

O'Reilly Pytest Class

Pytest Introduction

@__mharrison__

mattharrison / loadnino.py

Last active August 27, 2021 16:36

Load nino dataset

	# Data transformation from previous notebook
	# col names in tao-all2.col from website
	names = '''obs
	year
	month
	day
	date
	latitude
	longitude
	zon.winds

mattharrison / gist:7e78e7f7efc3e66312365e7774f5cc6d

Created July 3, 2018 16:22

Load nino

	# Data transformation from previous notebook
	# col names in tao-all2.col from website
	names = '''obs
	year
	month
	day
	date
	latitude
	longitude
	zon.winds

mattharrison / gist:0e162315c216ef8f57afd833bba8e5a2

Created June 5, 2018 17:29

	%%time
	cols = ['obs', 'year', 'month', 'day', 'date', 'latitude', 'longitude',
	'zon.winds', 'mer.winds', 'humidity', 'air temp.', 's.s.temp.']
	nino = pd.read_csv('data/tao-all2.dat.gz', sep=' ', names=cols, header=None,
	na_values='.', parse_dates=[[1,2,3]])
	cols = [c.strip().rstrip('.').replace(' ', '_').replace('.', '_')
	for c in nino.columns]
	nino.columns = cols
	nino.date = pd.to_datetime(nino.date, format='%y%m%d')
	nino['zon_winds_mph'] = nino.zon_winds * 2.237

mattharrison / README.txt

Created June 1, 2018 13:49