How to count how many time a word occurs in an xml file

2022-12-20 00:43 问答作者：

I was given the task of doing quality check on a machine translation xml file. The translations are from English to a foreign language. I have about 2000 translation blocks in the file and I have to check 200 of them by adding my remarks in the block enclosed in a < comment > tag with a quality attribute. Is there a linux com开发者_如何转开发mand or some text editor out there which can count the number of comment tags I add or just the number of time the word '/comment' occurs so I don't have to keep track manually?

grep '/comment' yourfile.xml -o | wc -l

This XSLT stylesheet can be run on any platform and will tell you how many comment elements there are in the XML document:

<?xml version="1.0" encoding="UTF-8"?>
  <xsl:stylesheet
   version="1.0"
   xmlns:xsl="http://www.w3.org/1999/XSL/Transform">
  <xsl:output method="text" encoding="UTF-8" omit-xml-declaration="yes"/>
  <xsl:template match="/">
    <xsl:value-of select="count(//comment)"/>
  </xsl:template>
</xsl:stylesheet>

If you add a XSLT processing instruction at the top of the XML file that points to this XSLT( e.g. <?xml-stylesheet href="countComments.xsl" type="text/xsl"?> ), then you could just load the XML file in a browser and see the number displayed.

If you know that the </comment> doesn't occur more than once per line, just use grep -c "</comment>". Example:

[~/.logs]> grep -c ldap johnf.2010-02-12.log
103

This searches for the string ldap in the file johnf.2010-02-12.log. The string appears on 103 distinct lines.

As long as the comments appear on their own line, you could try

cat file | grep -c comment

The -c stands for 'count'.

your tag says linux, so i assume you have *nix tools like awk

awk '{for(i=1;i<=NF;i++){if($i=="/comment"){++c} } }END{print "total: "c}' xmlfile

继续阅读：text text-editor xml

How to count how many time a word occurs in an xml file

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？