apache pig - Unable to Remove Special Characters In Pig -
i have text file want load onto pig engine, text file have names in in separate rows, , data has errors in it.....special characters....something this:
ja@@$s000on j@@a%^ke t!!ina mel@ani
i want remove special characters names using regex ....one way found job in pig , have output as...
jason jake tina melani
can please tell me regex job in pig. write command unable use regex_extract , regex_extract_all function.
can explain significance of number 1 pass function argument after defining regex.
any highly appreciated.
you can use replace regex solve problem.
input.txt ja@@$s000on j@@a%^ke t!!ina mel@ani pigscript: = load 'input.txt' line; b = foreach generate replace(line,'([^a-za-z\\s]+)',''); dump b; output: (jason) (jake tina melani)
Comments
Post a Comment