print 'before: {:.2f} after'.format(1.5555)

before: 1.56 after

print '{1},{0},{1},{2},{0}'.format('pos',777,True)

777,pos,777,True,pos

print '{name},{age}'.format(age=18,name='cutie')

cutie,18

has=['first', 2.00, 'third']
print '1st {0[0]} all: {0} last {0[2]} end'.format(has)

1st first all: ['first', 2.0, 'third'] last third end

print 'start--- {:,} ---end'.format(9876543210)

start--- 9,876,543,210 ---end

print 'start:{:>8}'.format(123)

start:     123

print 'start:{:0>8}'.format(123)

start:00000123

print 'start:{:A>8}'.format(123)

start:AAAAA123

Question	answer	explain
how to get how big in memory a DataFrame object is?	df.info()
what is the best representative of null value in Pandas object?	np.nan	import numpy as np
what is the best way to slice a DataFrame by index?	df.iloc[-5:, 2:]	use iloc method
how to convert a DataFrame (excluding indexes) to a numpy ndarray?	df.values	it is a attribute, can’t be called
what is the most basic way to create a DataFrame?	pd.DataFrame(dict)	pass dictionary to; keys are column names
what is broadcasting?	pd[‘new’]=7	all the values of the new column will be 7
how to change df’s column names, index names?	pd.columns = [‘a’,’b’,…] pd.index = [‘c’,’d’,…]	assign value directly

when read csv, how to specify names of the column	pd.read_csv(path, names=[‘a’,’b’,…..])	instead, pass header=None will prevent pandas using data as column names, but use 0,1,2,3 ….
when read csv, how to let pandas to turn some specific values into NaN?	pd.read_csv(path, na_values = ‘-1’) pdf.read_csv(path, na_values = {‘column3’:[‘ -2’, ‘wtf’,…]})	all the values which is character ‘-1’ will be rendered to NaN
how to parse data in reading csv	pd.read_csv(path, parse_dates = [[0,1,2]])	pandas will parse column 1, 2, 3 into one datetype column
does index of df have a name?	pd.index.name = ‘xxx’	assign a name to the index of df
how to save df to a csv file with other delimiters rather than ‘,’	pd.to_csv(path, sep=’\t’)	save to a csv file which separates data by tab


bias-variance trade-off	gradient descent
Ridge regression		cross-validation	measure of fit + measure of model complexity
Lasso regression	coordinate descent	feature selection	measure of fit + (different) measure of model complexity
Nearest Neighbor Regression & Kernel Regression

concave	hax Max value
convex	has Min value

bokeh 4th: server

bokeh 3rd: high-level charts