Research
- Crime Categories
- Murder Circumstances
- Charges
- Murder Numbers by SHR
- Definitions of Murder
- Crime Literature
- Other Literature
- Seminars
- Journal Ranking
- Laws
- Changes in Law and Reporting in Michigan
- Citation Guides
- Datasets
Writing
Methods
- BLP
- Econometrics Models
- Econometrics Tests
- Econometrics Resources
- Event Study Plots
- Metrics Literature
- Machine Learning
Python-related
- Python Basic Commands
- Pandas Imports and Exports
- Pandas Basic Commands
- Plotting in Python
- Python web scraping sample page
- Two Sample t Test in Python
- Modeling in Python
R-related
- R Basics
- R Statistics Basics
- RStudio Basics
- R Graphics
- R Programming
- Accessing MySQL Databases from R
Latex-related
Stata-related
SQL
Github
Linux-related
Conda-related
AWS-related
Webscraping
Interview Prep
Other
How to Import to and Export from Python Pandas
The most common use of Python Pandas is to process data. However, data could be recorded in different formats: tab-delimited text data, .csv files, .json files, .dta files, etc. Below I would like to list some sources and offer some examples of how to import certain types of data into Python Pandas and how to export them from it. Please find the Table of Contents below.
Table of Contents
Import
Fixed Width
Find link here. The example they have is:
file_name = r"/tmp/file.txt"
fwidths = [11, 11, 11, 11, 11, 11]
df = pd.read_fwf(file_name, widths = fwidths,
names = ['col0', 'col1', 'col2', 'col3', 'col4', 'col5'])
Excel File
Find link here. I have combined two of their examples:
pd.read_excel('tmp.xlsx', index_col = 0, sheet_name = 'Sheet3')
Stata File
Find link here.
The pandas method read_stata
is not able to import value labels. In cases when we would like to import value labels, we can make use of the library pyreadstat
(Github repo).
df_circum, meta = pyreadstat.read_dta('../Data/1988/y1988.dta', apply_value_formats=True)
Tab Delimited File
Find link here. Example given on the page:
df = pd.read_csv('file_location\filename.txt', delimiter = '\t')
Clipboard
df = pd.read_clipboard()
R
# read in the R file
file = pyreadr.read_r("nibrs_1991_2020_offense_segment_rds/nibrs_offense_segment_1998.rds")
dfv98 = file[None]
dfv98
Export
Stata
Official documentation here. One example is:
df.to_stata('state_expenditures.dta', write_index=False)
LaTeX
df.style.hide(axis='index').to_latex('../../mean_crime.tex',position="h!",position_float="centering",hrules=True,label="tbl:crimedate",caption="Month When Maximum is Reached")