项目作者: UTHSCSA-CIRD

项目描述 :
Wrapper for a couple of address parsing/standardizing libraries for the purpose of cleaning up addresses in CSV files.
高级语言: Python
项目地址: git://github.com/UTHSCSA-CIRD/addressfix.git
创建时间: 2017-08-12T19:30:42Z
项目社区:https://github.com/UTHSCSA-CIRD/addressfix

开源协议:MIT License

下载


addressfix

Wrapper for a couple of address parsing/standardizing libraries for the purpose of cleaning up addresses in CSV files.

Instructions:

Setup:

  1. pip install -r requirements.txt

Run with default settings (won’t work right for this input file):

  1. python addressfix.py -f test_address_input.csv

Run keeping columns 0,2,3,4 in the output and using column 4 as the address:

  1. python addressfix.py -a 4 -k 0,2,3,4 -f test_address_input.csv
  2. NOTE: Python counts everything from 0!! So -a 4 means the FIFTH column
  3. the way most non-programmers would count them in a spreadsheet, and the
  4. 0,2,3,4 would likewise be first and the third through fifth columns!

Get help

  1. python addressfix.py -h