puppet的基础环境介绍:
puppet服务器和客户端都已下载了epel的外部yum源,都已通过yum程序自动安装了puppet程序,过程比较简单,这里就不一一介绍了,机器都放置在同一局域网内,cn7788.com的域名,内部有内网DNS环境,没有用LDP作为域控,局域网还有其它客户端,由于不需要使用puppet环境,所以就不一一介绍了。
1
2
3
4
|
server.cn7788.com 192.168.1.124 puppet-master
client.cn7788.com 192.168.1.125 puppet-client
lamp.cn7788.com 192.168.1.126 puppet-client
xen.cn7788.com 192.168.1.144 puppet-client
|
大家可以将上面的域名对应关系可将其都写在各自机器的/etc/hosts文件里,在各个puppet客户端上建议ntpdate精准对时(因为puppet的证书对时间要求严格),不然puppet-client连接时会报如下错误:
1
2
3
4
5
6
7
|
warning: peer certificate won't be verified
in
thisSSL session
info: Caching certificate
for
client.cn7788.com
info: Caching certificate_revocation_list
for
ca
err: Could not retrieve catalog from remote server:certificate verify failed. This is oftenbecause the
time
is out of
sync
on the server or client
warning: Not using cache on failed catalog
err: Could not retrieve catalog; skipping run
err: Could not send report: certificate verifyfailed. This is often because the timeis out of
sync
on the server or client
|
需求如下:客户机机器xen.cn7788.com和lamp.cn7788.com没有安装nagios客户端程序,这时想过通过puppet-server推送SHELL脚本自动安装,其它的客户端暂时没这么需求,这个应该如何实现呢?
由于客户端节点机器比较多,所以这里需要用到节点和模块的概念,这里我们先建立名为nagioscli的模块,如下所示:
1
|
mkdir
-p
/etc/puppet/modules/nagioscli/
{manifests,files,templates}
|
files目录下的nagioscli.sh文件内容如下所示:
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
|
#!/bin/bash
useradd
nagios
cd
/usr/local/src
wget wget http:
//syslab
.comsenz.com
/downloads/linux/nagios-plugins-1
.4.13.
tar
.gz
wget http:
//syslab
.comsenz.com
/downloads/linux/nrpe-2
.12.
tar
.gz
tar
zxvf nagios-plugins-1.4.13.
tar
.gz
cd
nagios-plugins-1.4.13
.
/configure
make
make
install
chown
nagios:nagios
/usr/local/nagios
chown
-R nagios:nagios
/usr/local/nagios/libexec
cd
../
tar
zxvf nrpe-2.12.
tar
.gz
cd
nrpe-2.12
.
/configure
make
all
make
install
-plugin
make
install
-daemon
make
install
-daemon-config
sed
-i
's@allowed_hosts=127.0.0.1@allowed_hosts=114.112.11.11@'
/usr/local/nagios/etc/nrpe
.cfg
#114.112.11.11为nagios服务器的IP地址,这个可以根据实际需求更改。
/usr/local/nagios/bin/nrpe
-c
/usr/local/nagios/etc/nrpe
.cfg -d
echo
"/usr/local/nagios/bin/nrpe -c/usr/local/nagios/etc/nrpe.cfg -d"
>>
/etc/rc
.
local
|
site.pp文件内容如下:
1
|
import
"node.pp"
|
这里扩展了site.pp文件内容,它会载入node.pp文件,这样puppet-master在启动的时候,就会自动截入并处理node.pp文件了。
node.pp文件内容如下所示:
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
|
node
'lamp.cn7788.com'
{
file
{
"/usr/local/src/nagioscli.sh"
:
source
=>
"puppet://server.cn7788.com/modules/nagioscli/nagioscli.sh"
,
group => root,
owner => root,
mode => 755,
}
exec
{
"auto install naigios client"
:
command
=>
"sh /usr/local/src/nagioscli.sh"
,
user =>
"root"
,
path =>[
"/usr/bin"
,
"/usr/sbin"
,
"/bin"
,
"/bin/sh"
],
}
}
node
'xen.cn7788.com'
{
file
{
"/usr/local/src/nagioscli.sh"
:
source
=>
"puppet://server.cn7788.com/modules/nagioscli/nagioscli.sh "
,
group => root,
owner => root,
mode =>644,
}
exec
{
"auto install naigios client"
:
command
=>
"sh /usr/local/src/nagioscli.sh"
,
user =>
"root"
,
path =>[
"/usr/bin"
,
"/usr/sbin"
,
"/bin"
,
"/bin/sh"
],
}
}
node
'client.cn7788.com'
{
}
|
client.cn7788.com节点机器后面什么都没有,则表示没有任何操作在此节点机器上面,因为client机器也在puppet环境里,并配置成了自动连接,配置成如此,是防止自动连接时puppet频繁报错。
这里以xen.cn7788.com为例,在其主机上输入如下命令:
1
|
puppetd --
test
--server server.cn7788.com
|
xen.cn7788.com上命令显示结果如下所示:
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
|
info: Caching catalog
for
xen.cn7788.com
info: Applying configuration version
'1382622383'
---
/usr/local/src/nagioscli
.sh 2013-10-24 22:35:36.000000000 +0800
+++
/tmp/puppet-file
.22857.0 2013-10-24 22:39:08.000000000 +0800
@@ -1,4 +1,5 @@
#!/bin/bash
+yum -y
install
httpd gcc gcc-c++ glibcglibc-common gd gd-devel
useraddnagios
cd
/usr/local/src
wgetwget http:
//syslab
.comsenz.com
/downloads/linux/nagios-plugins-1
.4.13.
tar
.gz
info: FileBucket adding{md5}f75e9aa3fc301c8e9c85f2677feaa9b5
info:
/Stage
[main]
//Node
[xen.cn7788.com]
/File
[
/usr/local/src/nagioscli
.sh]:Filebucketed
/usr/local/src/nagioscli
.sh to puppet with sumf75e9aa3fc301c8e9c85f2677feaa9b5
notice:
/Stage
[main]
//Node
[xen.cn7788.com]
/File
[
/usr/local/src/nagioscli
.sh]
/content
: contentchanged
'{md5}f75e9aa3fc301c8e9c85f2677feaa9b5'
to
'{md5}a1ed4dc2b98450e3144530f32677f736'
notice:
/Stage
[main]
//Node
[xen.cn7788.com]
/Exec
[auto
install
naigios client]
/returns
:executed successfully
notice: Finished catalog run
in
283.11 seconds
|
执行时间比较长,总共耗时283.11秒,我们要检查下xen.cn7788.com的节点机器上是否开启了nrpe 进程,输入命令如下所示:
1
|
ps
aux |
grep
nrpe |
grep
–
v
grep
|
命令显示结果如下所示:
1
|
nagios 22331 0.0 0.1 5108 924 ? Ss 22:35 0:00
/usr/local/nagios/bin/nrpe
-c
/usr/local/nagios/etc/nrpe
.cfg -d
|
我们检查下/etc/rc.local,看此命令有没有添加进去,命令如下:
1
|
grep
-
v
"^#"
/etc/rc
.
local
|
命令执行结果显示如下所示:
1
2
|
touch
/var/lock/subsys/local
/usr/local/nagios/bin/nrpe
-c
/usr/local/nagios/etc/nrpe
.cfg -d
|
检查结果说明puppet-master的nagioscli模块是正常的,lamp.cn7788.com的结果类似,这里就不再贴出检测结果了,我们主要看下lamp.cn7788.com总共耗时多少,命令如下所示:
1
|
puppetd --
test
--serverserver.cn7788.com
|
结果如下所示:
1
2
3
4
|
info: Caching catalog
for
lamp.cn7788.com
info: Applying configuration version
'1382622383'
notice:
/Stage
[main]
//Node
[lamp.cn7788.com]
/Exec
[autoinstall naigios client]
/returns
: executed successfully
notice: Finished catalog run
in
169.08 seconds
|
执行时间比较长,总共耗时169.08秒。
其实工作中像这种推送脚本执行的需求还是很多的,类似在各种不同名字的节点上执行的优化服务器命令、批量清除varnish缓存加速服务器缓存、根据机器名推送文件,我们只需要将此案例稍为变通下即可在工作中投入应用了。