apache pig - Unable to Remove Special Characters In Pig -


i have text file want load onto pig engine, text file have names in in separate rows, , data has errors in it.....special characters....something this:

ja@@$s000on    j@@a%^ke t!!ina mel@ani 

i want remove special characters names using regex ....one way found job in pig , have output as...

jason jake tina melani 

can please tell me regex job in pig. write command unable use regex_extract , regex_extract_all function.
can explain significance of number 1 pass function argument after defining regex.

any highly appreciated.

you can use replace regex solve problem.

input.txt   ja@@$s000on   j@@a%^ke t!!ina mel@ani    pigscript: = load 'input.txt' line;   b = foreach generate replace(line,'([^a-za-z\\s]+)','');   dump b;    output:   (jason)   (jake tina melani)   

Comments

Popular posts from this blog

php - Submit Form Data without Reloading page -

linux - Rails running on virtual machine in Windows -

php - $params->set Array between square bracket -