2. 线程控制

2.1. 创建线程

#include <pthread.h>

int pthread_create(pthread_t *restrict thread,
                   const pthread_attr_t *restrict attr,
                   void *(*start_routine)(void *), void *restrict arg);

返回值：成功返回 0，失败返回错误号。以前学过的系统函数都是成功返回 0，失败返回 -1，而错误号保存在全局变量 errno 中，而 pthread 库的函数都是通过返回值返回错误号，虽然每个线程也都有一个 errno，但这是为了兼容其它函数接口而提供的，pthread 库本身并不使用它，通过返回值返回错误码更加清晰。

在一个线程中调用 pthread_create() 创建新的线程后，当前线程从 pthread_create() 返回继续往下执行，而新的线程所执行的代码由我们传给 pthread_create 的函数指针 start_routine 决定。start_routine 函数接收一个参数，是通过 pthread_create 的 arg 参数传递给它的，该参数的类型为 void *，这个指针按什么类型解释由调用者自己定义。start_routine 的返回值类型也是 void *，这个指针的含义同样由调用者自己定义。start_routine 返回时，这个线程就退出了，其它线程可以调用 pthread_join 得到 start_routine 的返回值，类似于父进程调用 wait(2) 得到子进程的退出状态，稍后详细介绍 pthread_join。

pthread_create 成功返回后，新创建的线程的 id 被填写到 thread 参数所指向的内存单元。我们知道进程 id 的类型是 pid_t，每个进程的 id 在整个系统中是唯一的，调用 getpid(2) 可以获得当前进程的 id，是一个正整数值。线程 id 的类型是 thread_t，它只在当前进程中保证是唯一的，在不同的系统中 thread_t 这个类型有不同的实现，它可能是一个整数值，也可能是一个结构体，也可能是一个地址，所以不能简单地当成整数用 printf 打印，调用 pthread_self(3) 可以获得当前线程的 id。

attr 参数表示线程属性，本章不深入讨论线程属性，所有代码例子都传 NULL 给 attr 参数，表示线程属性取缺省值，感兴趣的读者可以参考 APUE2e。首先看一个简单的例子：

#include <pthread.h>
#include <stdio.h>
#include <stdlib.h>
#include <string.h>
#include <unistd.h>

pthread_t ntid;

void printids(const char *s) {
    pid_t pid;
    pthread_t tid;

    pid = getpid();
    tid = pthread_self();
    printf("%s pid %u tid %u (0x%x)\n", s, (unsigned int)pid, (unsigned int)tid,
           (unsigned int)tid);
}

void *thr_fn(void *arg) {
    printids(arg);
    return NULL;
}

int main(void) {
    int err;

    err = pthread_create(&ntid, NULL, thr_fn, "new thread: ");
    if (err != 0) {
        fprintf(stderr, "can't create thread: %s\n", strerror(err));
        exit(1);
    }
    printids("main thread:");
    sleep(1);

    return 0;
}

编译运行结果如下：

bash

$ gcc main.c -lpthread
$ ./a.out
main thread: pid 7398 tid 3084450496 (0xb7d8fac0)
new thread:  pid 7398 tid 3084446608 (0xb7d8eb90)

可知在 Linux 上， thread_t 类型是一个地址值，属于同一进程的多个线程调用 getpid(2) 可以得到相同的进程号，而调用 pthread_self(3) 得到的线程号各不相同。

由于 pthread_create 的错误码不保存在 errno 中，因此不能直接用 perror(3) 打印错误信息，可以先用 strerror(3) 把错误码转换成错误信息再打印。

如果任意一个线程调用了 exit 或 _exit ，则整个进程的所有线程都终止，由于从 main 函数 return 也相当于调用 exit ，为了防止新创建的线程还没有得到执行就终止，我们在 main 函数 return 之前延时 1 秒，这只是一种权宜之计，即使主线程等待 1 秒，内核也不一定会调度新创建的线程执行，下一节我们会看到更好的办法。

思考题：主线程在一个全局变量 ntid 中保存了新创建的线程的 id，如果新创建的线程不调用 pthread_self 而是直接打印这个 ntid ，能不能达到同样的效果？

2.2. 终止线程

如果需要只终止某个线程而不终止整个进程，可以有三种方法：

从线程函数 return 。这种方法对主线程不适用，从 main 函数 return 相当于调用 exit 。
一个线程可以调用 pthread_cancel 终止同一进程中的另一个线程。
线程可以调用 pthread_exit 终止自己。

用 pthread_cancel 终止一个线程分同步和异步两种情况，比较复杂，本章不打算详细介绍，读者可以参考 APUE2e。下面介绍 pthread_exit 的和 pthread_join 的用法。

#include <pthread.h>

void pthread_exit(void *value_ptr);

value_ptr 是 void * 类型，和线程函数返回值的用法一样，其它线程可以调用 pthread_join 获得这个指针。

需要注意，pthread_exit 或者 return 返回的指针所指向的内存单元必须是全局的或者是用 malloc 分配的，不能在线程函数的栈上分配，因为当其它线程得到这个返回指针时线程函数已经退出了。

#include <pthread.h>

int pthread_join(pthread_t thread, void **value_ptr);

返回值：成功返回 0，失败返回错误号

调用该函数的线程将挂起等待，直到 id 为 thread 的线程终止。 thread 线程以不同的方法终止，通过 pthread_join 得到的终止状态是不同的，总结如下：

如果 thread 线程通过 return 返回，value_ptr 所指向的单元里存放的是 thread 线程函数的返回值。
如果 thread 线程被别的线程调用 pthread_cancel 异常终止掉，value_ptr 所指向的单元里存放的是常数 PTHREAD_CANCELED。
如果 thread 线程是自己调用 pthread_exit 终止的，value_ptr 所指向的单元存放的是传给 pthread_exit 的参数。

如果对 thread 线程的终止状态不感兴趣，可以传 NULL 给 value_ptr 参数。

看下面的例子（省略了出错处理）：

#include <pthread.h>
#include <stdio.h>
#include <stdlib.h>
#include <unistd.h>

void *thr_fn1(void *arg) {
    printf("thread 1 returning\n");
    return (void *)1;
}

void *thr_fn2(void *arg) {
    printf("thread 2 exiting\n");
    pthread_exit((void *)2);
}

void *thr_fn3(void *arg) {
    while (1) {
        printf("thread 3 writing\n");
        sleep(1);
    }
}

int main(void) {
    pthread_t tid;
    void *tret;

    pthread_create(&tid, NULL, thr_fn1, NULL);
    pthread_join(tid, &tret);
    printf("thread 1 exit code %d\n", (int)tret);

    pthread_create(&tid, NULL, thr_fn2, NULL);
    pthread_join(tid, &tret);
    printf("thread 2 exit code %d\n", (int)tret);

    pthread_create(&tid, NULL, thr_fn3, NULL);
    sleep(3);
    pthread_cancel(tid);
    pthread_join(tid, &tret);
    printf("thread 3 exit code %d\n", (int)tret);

    return 0;
}

运行结果是：

bash

$ ./a.out
thread 1 returning
thread 1 exit code 1
thread 2 exiting
thread 2 exit code 2
thread 3 writing
thread 3 writing
thread 3 writing
thread 3 exit code -1

可见在 Linux 的 pthread 库中常数 PTHREAD_CANCELED 的值是 -1。可以在头文件 pthread.h 中找到它的定义：

#define PTHREAD_CANCELED ((void *) -1)

一般情况下，线程终止后，其终止状态一直保留到其它线程调用 pthread_join 获取它的状态为止。但是线程也可以被置为 detach 状态，这样的线程一旦终止就立刻回收它占用的所有资源，而不保留终止状态。不能对一个已经处于 detach 状态的线程调用 pthread_join ，这样的调用将返回 EINVAL 。对一个尚未 detach 的线程调用 pthread_join 或 pthread_detach 都可以把该线程置为 detach 状态，也就是说，不能对同一线程调用两次 pthread_join ，或者如果已经对一个线程调用了 pthread_detach 就不能再调用 pthread_join 了。

#include <pthread.h>

int pthread_detach(pthread_t tid);

返回值：成功返回 0，失败返回错误号。

第 1 章程序的基本概念

第 2 章常量、变量和表达式

第 3 章简单函数

第 4 章分支语句

第 5 章深入理解函数

第 6 章循环语句

第 7 章结构体

第 8 章数组

第 9 章编码风格

第 10 章 gdb

第 11 章排序与查找

第 12 章栈与队列

第 14 章计算机中数的表示

第 15 章数据类型详解

第 16 章运算符详解

第 17 章计算机体系结构基础

第 18 章 x86 汇编程序基础

第 19 章汇编与 C 之间的关系

第 20 章链接详解

第 21 章预处理

第 22 章 Makefile 基础

第 23 章指针

第 24 章函数接口

第 25 章 C 标准库

第 26 章链表、二叉树和哈希表

第 28 章文件与 I/O

第 29 章文件系统

第 30 章进程

第 31 章 Shell 脚本

第 32 章正则表达式

第 33 章信号

第 34 章终端、作业控制与守护进程

第 35 章线程

第 36 章 TCP/IP 协议基础

第 37 章 socket 编程

2. 线程控制

2.1. 创建线程

2.2. 终止线程

2. 线程控制 ​

2.1. 创建线程 ​

2.2. 终止线程 ​

2. 线程控制

2.1. 创建线程

2.2. 终止线程