我有三个主节点,每个主节点的磁盘大小为80 GB。最近我遇到了这个问题:
Normal Pulling 52s (x2 over 6m17s) kubelet, 192.168.10.37 pulling image "gcr.io/kubeflow-images-public/tensorflow-serving-1.8gpu:latest"
Warning Evicted 8s (x5 over 4m19s) kubelet, 192.168.10.37 The node was low on reso
我正在运行一个数据库加载进程(osm2pgsql),它失败了:
Processing: Node(17404k 148.8k/s) Way(1351k 6.38k/s) Relation(9520 29.94/s)way_done failed: ERROR: could not extend file "base/140667/152463": No space left on device
HINT: Check free disk space.
(7)
Arguments were: 187226311,
在导入开始时,mem报告:
tot
我愚蠢地决定从14.04LTS更新到14.10,然后再更新15.04。
从那以后,我的网站已经崩溃,文件系统已经变成只读。我不知道哪里出了问题,因为更新成功完成了。
这就是我到目前为止所发现的:
root@lew:/# service apache2 status
apache2.service - LSB: Apache2 web server
Loaded: loaded (/etc/init.d/apache2)
Active: failed (Result: exit-code) since Sun 2015-07-12 08:36:18 EDT; 31min ag
我想在主机上增长一个ext4卷,但是我注意到没有有效的分区表可以删除和重做:
fdisk -u /dev/vdb
/dev/vdb: device contains a valid 'ext4' signature; it is strongly recommended to wipe the device with wipefs(8) if this is unexpected, in order to avoid possible collisions
Device does not contain a recognized partition table.
Create
每天早上,我的dovecot服务都会中断,我需要再次启动它,而且它每天都在重复,所以我正在寻找一些线索来找出它下降的原因,我发现/var/log/maillog文件中的错误:
"failed to write to main log: length=165 result=-1 errno=28 (No space left on device)"
"write failed on panic log: length=122 result=-1 errno=28 (No space left on device)"
我看到显示错误的时间与服务崩溃的时间相同,所以我