files indexed by production date

2022-12-29 22:01 问答作者：

Each day an application creates a file called file_YYYYMMDD.csv where YYYYMMDD is the production date. Bu开发者_高级运维t sometimes the generation fails and no files are generated for a couple of days.

I'd like an easy way in a bash or sh script to find the filename of the most recent file, which has been produced before a given reference date.

Typical usage: find the last generated file, disregarding those produced after the May 1st.

Thanks for your help

This script avoids:

Using sed repeatedly in a loop
Parsing ls
Creating a subshell in the while loop
Processing files that don't match the file_*.csv name pattern

Here's the script:

#!/bin/bash
while read -r file
do
    date=${file#*_}    # strip off everything up to and including the underscore
    date=${date%.*}    # strip off the dot and everything after
    if [[ $date < $1 ]]
    then
        break
    fi
done < <(find -name "file_*.csv" | sort -r)

# do something with $file, such as:
echo "$file"

Edit:

With Bash >= 3.2, you can do this using a regular expression:

#!/bin/bash
regex='file_([[:digit:]]+).csv'
while read -r file
do
    [[ $file =~ $regex ]]
    date=${BASH_REMATCH[1]}
    if [[ $date < $1 ]]
    then
        break
    fi
done < <(find -name "file_*.csv" | sort -r)

# do something with $file, such as:
echo "$file"

Try this:

#!/bin/bash

ls -r | while read fn; do
    date=`echo $fn | sed -e 's/^file_\([0-9]*\)\.csv$/\1/'` || continue
    if [ $date -lt $1 ]; then
        echo $fn
        exit
    fi
done

Just call this script with the reference date you want to compare with. Replace -lt with -le if you want to include the reference date.

Edit: An alternate solution, without piping an echoed variable. Note that I didn't test it, but it should work, too.

#!/bin/bash

ls -r | sed -e 's/^file_\([0-9]*\)\.csv$/\1/' | while read date; do
    if [ $date -lt $1 ]; then
        echo "file_${date}.csv"
        exit
    fi
done

Sorting file names with man 1 sort will fail if there's a newline character in a file name.

Instead we should use something like:

touch $'filename\nwith\777pesky\177chars.txt'  # create a test file

ls -1db * 

find ... -print0 | LC_ALL=C sort0 ...

see:

Find all used extensions in subdirectories,

http://codesnippets.joyent.com/posts/show/2300

继续阅读：bash date file shell

files indexed by production date

更多精彩内容

精彩评论

最新问答

央视是哪个频道？

请问买过的朋友，舒提啦旅行箱实际使用体验如何？？

检查不孕不育需要的费用？

海信ULED电视画质有什么不同的地方?？

钉子可以挂的住画框幕布吗？

问答排行榜

河神2九牛入海钓河妖是第几集河妖什么来历可活吞牛？

性激素六项检查的最佳时间是多久？多少钱？？

Easiest way to get words of one line from istream into a vector?

《梦在燃烧 (《三国演义》动画片主题曲)》MP3歌词-汤子星？

抽烟只抽炫赫门？