MIT Missing Semester 2: Shell Tools and Scripting¶

这节课主要介绍了写脚本时用到的相关命令，以及给出了一些简单脚本示例。

Lecture overview¶

保留关键字 reserved word¶

在写脚本的时候，形如$1指一条命令的第一个参数，在写脚本的时候我们可以利用它化简脚本中重复出现的内容。

$# 是命令的参数数量。

$$ 是运行该命令的进程ID。

$@ 是展开命令中的所有参数。

$_ 指上一次命令中的最后一个参数。

$? 指上一次命令执行完成后的错误代码(0表示正确执行没有错误，非0则有错误)。

!! 指上一条命令。

逻辑运算符¶

A || B 当前者错误运行后才会执行B，否则只执行A。

A && B 当A正确运行后才会执行B，否则只执行A。

A ; B 执行完A再执行B，没有约束。

替换 Substitution¶

命令替换¶

foo=$(pwd) 会将pwd得到的输出(也就是当前路径)存储到变量foo中。至于$(pwd)，相当于创建了子shell执行命令pwd。

echo "we are at $(pwd)" (注意是双引号)会输出we are at [当前路径]。

进程替换 process substitution¶

cat <(ls) <(ls ..) 的作用是将上一层目录的内容存在一个临时文件中，将当前目录的内容存在一个临时文件中，然后将file handle交给cat命令，拼接后输出。所以得到的内容是当前目录和上一级目录的内容。

通配符 Wildcard¶

ls *.sh 查找后缀为.sh的文件

ls project? "?"只展开一个字符进行搜索

convert image.png image.jpg可以写成convert image.{png,jpg}

touch {foo,bar}/{a..j} 可以进行一个笛卡尔积式的展开

find . -name src -type d 这条指令的意思是在当前目录下使用find，找名字为src，type为目录(directory)的路径；find . -path "**/test/*.py -type f"即使在当前目录下使用find，找路径是xxx的python文件。

fd命令用正则表达式来搜索相关内容。

Shellcheck 分析脚本语法错误¶

shellcheck可以帮助分析shell脚本中的语法错误等。

目录结构¶

显示目录结构可以使用tree, broot等命令。

An example .sh script¶

#!/bin/bash
echo "Starting program at $(date)" # Date will be substituted

echo "Running program $0 with $# arguments with pid $$"

for file in "$@"; do
  grep foobar "$file" ? /dev/null 2> /dev/null
  # When pattern is not found, grep has exit status
  # We redirect STDOUT and STDERR to a null register place

  if [["$?" -ne 0]]; then
    echo "File $file does not have any foobar, adding one"
    echo "# foobar" >> "$file"
  fi
done

The program above is to find the word "foobar" in those files, and if a file don't contain "foobar", then add "# foobar" to the end of the file.

An example .py script¶

#!/usr/bin/env python
import sys
for arg in reversed(sys.argv[1:]):
  print(arg)
# foobar

将参数倒序输出。

Some Trivial Things¶

可以使用tldr去查看一些命令的使用例子，它不像man命令列出了所有文档内容，而是举例形式给出。

Exercises¶

Read man ls and write an ls command that lists files in the following manner
Includes all files, including hidden files
Sizes are listed in human readable format (e.g. 454M instead of 454279954)
Files are ordered by recency
Output is colorized

Sol. For including hidden files, we can use -a. For human readable format-size, we can use -h. For files ordered by recency, use -t. For output is colorized, use --color=auto.

Write bash functions marco and polo that do the following. Whenever you execute marco the current working directory should be saved in some manner, then when you execute polo, no matter what directory you are in, polo should cd you back to the directory where you executed marco. For ease of debugging you can write the code in a file marco.sh and (re)load the definitions to your shell by executing source marco.sh.

Sol. First, we need to use redirection to write down our current path. And in polo, we invoke the file which contain out record of the path and use cd to enter it. Here's the bash code.

#!/bin/bash
marco(){
    echo "$(pwd)" > $HOME/Desktop/MIT-Missing/marco.log
    echo "save pwd $(pwd)"
}

polo(){
    echo "$HOME/Desktop/MIT-Missing/marco.log"
    cd "$(cat "$HOME/Desktop/MIT-Missing/marco.log")"
}

Say you have a command that fails rarely. In order to debug it you need to capture its output but it can be time consuming to get a failure run. Write a bash script that runs the following script until it fails and captures its standard output and error streams to files and prints everything at the end. Bonus points if you can also report how many runs it took for the script to fail.

 #!/usr/bin/env bash

 n=$(( RANDOM % 100 ))

 if [[ n -eq 42 ]]; then
    echo "Something went wrong"
    >&2 echo "The error was using magic numbers"
    exit 1
 fi

 echo "Everything went according to plan"

Sol. We need to write a script to detect when the bugger.sh produce error. So the first step we write down echo bugger.sh >> bug_detect.log in the loops. > bug_detect.log is used for redirecting the output stream.

Moreover, we need to detect whether the bugger's execution in this iteration has finished correctly, that is, return 0. Use $? to get the return value.

Below is the answer.

#!/bin/bash

echo > bug_detect.log

for ((count = 0; ; count ++))
do
    ./buggy.sh >> bug_detect.log
    if [[ $? -ne 0 ]]; then
        echo "The script failed after $count iterations."
        break
    fi
done