Working with messy data
ylebras/openrefine-docker
3333
$__history_id__
$__galaxy_url__
8080
$__galaxy_url__
&1)
while [[ \${STATUS} =~ "refused" ]]
do
echo "Waiting for openrefine: \$STATUS \n"
STATUS=\$(curl --include 'http://127.0.0.1:3333' 2>&1)
sleep 4
done
]]>
`_ (previously Google Refine) is a powerful tool for working with messy data:
cleaning it; transforming it from one format into another; and extending it with web services and external data.
.. class:: infomark
Example input file (TAB separated)::
"name" "longitude" "latitude" "countryCode"
Accipiter striatus Vieillot, 1808 -60.291838 46.328137 CA
Accipiter striatus Vieillot, 1808 -114.58927 35.022485 US
Accipiter striatus Vieillot, 1808 -93.37406 30.00586 US
Accipiter striatus Vieillot, 1808 -79.336288 43.682218 CA
Accipiter striatus Vieillot, 1808 -109.156024 31.904185 US
Accipiter striatus Vieillot, 1808 -71.098031 42.297408 US
Accipiter striatus Vieillot, 1808 -110.927215 32.18203 US
]]>