Remove junk from a file

2023-01-23 19:58 问答作者：

I have a csv file with some junk at开发者_如何转开发 the beginning of the file. How do I get rid of it?

sh-3.2# more data_combined.csv
ï»¿84252,1,A ROSEAL

The file should start with the number 842...

For the data shown, this should do the trick (assuming a single-byte codeset such as ISO 8859-1, and not UTF-8, for example):

sed '1s/^...//' data_combined.csv

If it is UTF-8, then there are 6 bytes of garbage at the start. If sed is run with a UTF-8 locale, the '.' metacharacter matches a UTF-8 character (2 bytes each in the case shown), so the same expression works fine. If sed is run with a SBCS (single-byte code set) such as 8859-1, then you'd need to use a pattern like:

sed '1s/^.\{6\}//' data_combined.csv

Actually, it would use as many characters to write 6 dots; but the generalization is perhaps clearer.

As Dennis Williamson correctly said in the all too brief interval while I slept, to remove non-digits from the start of the first line, use:

sed '1s/^[^0-9]*//' data_combined.csv

继续阅读：grep sed

Remove junk from a file

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

王昌瑞《潜梦追凶》剧组庆生新锐演员未来可期？

Is it allowed to ask users to enter credit card details for own payment method?

Escaping "<" in Perl-generated XML

imessage会显示已读吗？

微信重新建群怎么建？