2013-08-06 27 views
3

我將celeryd和celerybeat設置爲守護進程,他們的工作時間不長。但是有一段時間以後,它不會啓動worker,也不會創建pid文件。celeryd和celerybeat pid文件沒有被創建,工作人員沒有啓動,但輸出說OK

這裏是我的/etc/default/celeryd

# Name of nodes to start 
CELERYD_NODES="w1 w2 w3 w4 w5 w6 w7 w8" 

# Extra arguments to celeryd 
CELERYD_OPTS="--time-limit=300 --concurrency=8" 

# Where to chdir at start. 
CELERYD_CHDIR="/srv/www/web-system/myproject" 

# %n will be replaced with the nodename. 
#CELERYD_LOG_FILE="/var/log/celery/%n.log" 
#CELERYD_PID_FILE="/var/run/celery/%n.pid" 
CELERYD_LOG_FILE="/srv/www/web-system/logs/celery/%n.log" 
CELERYD_PID_FILE="/srv/www/web-system/pids/celery/%n.pid" 

# Log level to use for celeryd. Default is INFO. 
CELERYD_LOG_LEVEL="INFO" 

# How to call "manage.py celeryd_multi" 
CELERYD_MULTI="$CELERYD_CHDIR/manage.py celeryd_multi" 

# How to call "manage.py celeryctl" 
CELERYCTL="$CELERYD_CHDIR/manage.py celeryctl" 

# Workers should run as an unprivileged user. 
#CELERYD_USER="celery" 
#CELERYD_GROUP="celery" 
CELERYD_USER="myuser" 
CELERYD_GROUP="myuser" 

# Name of the projects settings module. 
export DJANGO_SETTINGS_MODULE="myproject.settings" 

和這裏的init.d腳本:

#!/bin/sh -e 
# ============================================ 
# celeryd - Starts the Celery worker daemon. 
# ============================================ 
# 
# :Usage: /etc/init.d/celeryd {start|stop|force-reload|restart|try-restart|status} 
# :Configuration file: /etc/default/celeryd 
# 
# See http://docs.celeryproject.org/en/latest/tutorials/daemonizing.html#generic-init-scripts 


### BEGIN INIT INFO 
# Provides:   celeryd 
# Required-Start: $network $local_fs $remote_fs 
# Required-Stop:  $network $local_fs $remote_fs 
# Default-Start:  2 3 4 5 
# Default-Stop:  0 1 6 
# Short-Description: celery task worker daemon 
### END INIT INFO 

# some commands work asyncronously, so we'll wait this many seconds 
SLEEP_SECONDS=5 

DEFAULT_PID_FILE="/var/run/celery/%n.pid" 
DEFAULT_LOG_FILE="/var/log/celery/%n.log" 
DEFAULT_LOG_LEVEL="INFO" 
DEFAULT_NODES="celery" 
DEFAULT_CELERYD="-m celery.bin.celeryd_detach" 

CELERY_DEFAULTS=${CELERY_DEFAULTS:-"/etc/default/celeryd"} 

test -f "$CELERY_DEFAULTS" && . "$CELERY_DEFAULTS" 

# Set CELERY_CREATE_DIRS to always create log/pid dirs. 
CELERY_CREATE_DIRS=${CELERY_CREATE_DIRS:-0} 
CELERY_CREATE_RUNDIR=$CELERY_CREATE_DIRS 
CELERY_CREATE_LOGDIR=$CELERY_CREATE_DIRS 
if [ -z "$CELERYD_PID_FILE" ]; then 
    CELERYD_PID_FILE="$DEFAULT_PID_FILE" 
    CELERY_CREATE_RUNDIR=1 
fi 
if [ -z "$CELERYD_LOG_FILE" ]; then 
    CELERYD_LOG_FILE="$DEFAULT_LOG_FILE" 
    CELERY_CREATE_LOGDIR=1 
fi 

CELERYD_LOG_LEVEL=${CELERYD_LOG_LEVEL:-${CELERYD_LOGLEVEL:-$DEFAULT_LOG_LEVEL}} 
CELERYD_MULTI=${CELERYD_MULTI:-"celeryd-multi"} 
CELERYD=${CELERYD:-$DEFAULT_CELERYD} 
CELERYD_NODES=${CELERYD_NODES:-$DEFAULT_NODES} 

export CELERY_LOADER 

if [ -n "$2" ]; then 
    CELERYD_OPTS="$CELERYD_OPTS $2" 
fi 

CELERYD_LOG_DIR=`dirname $CELERYD_LOG_FILE` 
CELERYD_PID_DIR=`dirname $CELERYD_PID_FILE` 

# Extra start-stop-daemon options, like user/group. 
if [ -n "$CELERYD_USER" ]; then 
    DAEMON_OPTS="$DAEMON_OPTS --uid=$CELERYD_USER" 
fi 
if [ -n "$CELERYD_GROUP" ]; then 
    DAEMON_OPTS="$DAEMON_OPTS --gid=$CELERYD_GROUP" 
fi 

if [ -n "$CELERYD_CHDIR" ]; then 
    DAEMON_OPTS="$DAEMON_OPTS --workdir=$CELERYD_CHDIR" 
fi 


check_dev_null() { 
    if [ ! -c /dev/null ]; then 
     echo "/dev/null is not a character device!" 
     exit 75 # EX_TEMPFAIL 
    fi 
} 


maybe_die() { 
    if [ $? -ne 0 ]; then 
     echo "Exiting: $* (errno $?)" 
     exit 77 # EX_NOPERM 
    fi 
} 

create_default_dir() { 
    if [ ! -d "$1" ]; then 
     echo "- Creating default directory: '$1'" 
     mkdir -p "$1" 
     maybe_die "Couldn't create directory $1" 
     echo "- Changing permissions of '$1' to 02755" 
     chmod 02755 "$1" 
     maybe_die "Couldn't change permissions for $1" 
     if [ -n "$CELERYD_USER" ]; then 
      echo "- Changing owner of '$1' to '$CELERYD_USER'" 
      chown "$CELERYD_USER" "$1" 
      maybe_die "Couldn't change owner of $1" 
     fi 
     if [ -n "$CELERYD_GROUP" ]; then 
      echo "- Changing group of '$1' to '$CELERYD_GROUP'" 
      chgrp "$CELERYD_GROUP" "$1" 
      maybe_die "Couldn't change group of $1" 
     fi 
    fi 
} 


check_paths() { 
    if [ $CELERY_CREATE_LOGDIR -eq 1 ]; then 
     create_default_dir "$CELERYD_LOG_DIR" 
    fi 
    if [ $CELERY_CREATE_RUNDIR -eq 1 ]; then 
     create_default_dir "$CELERYD_PID_DIR" 
    fi 
} 

create_paths() { 
    create_default_dir "$CELERYD_LOG_DIR" 
    create_default_dir "$CELERYD_PID_DIR" 
} 

export PATH="${PATH:+$PATH:}/usr/sbin:/sbin" 


_get_pid_files() { 
    [ ! -d "$CELERYD_PID_DIR" ] && return 
    echo `ls -1 "$CELERYD_PID_DIR"/*.pid 2> /dev/null` 
} 

stop_workers() { 
    $CELERYD_MULTI stopwait $CELERYD_NODES --pidfile="$CELERYD_PID_FILE" 
    sleep $SLEEP_SECONDS 
} 


start_workers() { 
    $CELERYD_MULTI start $CELERYD_NODES $DAEMON_OPTS  \ 
         --pidfile="$CELERYD_PID_FILE"  \ 
         --logfile="$CELERYD_LOG_FILE"  \ 
         --loglevel="$CELERYD_LOG_LEVEL" \ 
         --cmd="$CELERYD"     \ 
         $CELERYD_OPTS 
    sleep $SLEEP_SECONDS 
} 


restart_workers() { 
    $CELERYD_MULTI restart $CELERYD_NODES $DAEMON_OPTS  \ 
          --pidfile="$CELERYD_PID_FILE" \ 
          --logfile="$CELERYD_LOG_FILE" \ 
          --loglevel="$CELERYD_LOG_LEVEL" \ 
          --cmd="$CELERYD"     \ 
          $CELERYD_OPTS 
    sleep $SLEEP_SECONDS 
} 

check_status() { 
    local pid_files= 
    pid_files=`_get_pid_files` 
    [ -z "$pid_files" ] && echo "celeryd not running (no pidfile)" && exit 1 

    local one_failed= 
    for pid_file in $pid_files; do 
     local node=`basename "$pid_file" .pid` 
     local pid=`cat "$pid_file"` 
     local cleaned_pid=`echo "$pid" | sed -e 's/[^0-9]//g'` 
     if [ -z "$pid" ] || [ "$cleaned_pid" != "$pid" ]; then 
      echo "bad pid file ($pid_file)" 
     else 
      local failed= 
      kill -0 $pid 2> /dev/null || failed=true 
      if [ "$failed" ]; then 
       echo "celeryd (node $node) (pid $pid) is stopped, but pid file exists!" 
       one_failed=true 
      else 
       echo "celeryd (node $node) (pid $pid) is running..." 
      fi 
     fi 
    done 

    [ "$one_failed" ] && exit 1 || exit 0 
} 


case "$1" in 
    start) 
     check_dev_null 
     check_paths 
     start_workers 
    ;; 

    stop) 
     check_dev_null 
     check_paths 
     stop_workers 
    ;; 

    reload|force-reload) 
     echo "Use restart" 
    ;; 

    status) 
     check_status 
    ;; 

    restart) 
     check_dev_null 
     check_paths 
     restart_workers 
    ;; 
    try-restart) 
     check_dev_null 
     check_paths 
     restart_workers 
    ;; 
    create-paths) 
     check_dev_null 
     create_paths 
    ;; 
    check-paths) 
     check_dev_null 
     check_paths 
    ;; 
    *) 
     echo "Usage: /etc/init.d/celeryd {start|stop|restart|kill|create-paths}" 
     exit 64 # EX_USAGE 
    ;; 
esac 

exit 0 

而且,我執行初始化腳本使用下面的命令:sh -x /etc/init.d/celeryd start,如the documentation建議,這是輸出:

# sh -x /etc/init.d/celeryd start 
+ SLEEP_SECONDS=5 
+ DEFAULT_PID_FILE=/var/run/celery/%n.pid 
+ DEFAULT_LOG_FILE=/var/log/celery/%n.log 
+ DEFAULT_LOG_LEVEL=INFO 
+ DEFAULT_NODES=celery 
+ DEFAULT_CELERYD=-m celery.bin.celeryd_detach 
+ CELERY_DEFAULTS=/etc/default/celeryd 
+ test -f /etc/default/celeryd 
+ . /etc/default/celeryd 
+ CELERYD_NODES=w1 w2 w3 w4 w5 w6 w7 w8 
+ CELERYD_OPTS=--time-limit=300 --concurrency=8 
+ CELERYD_CHDIR=/srv/www/web-system/myproject 
+ CELERYD_LOG_FILE=/srv/www/web-system/logs/celery/%n.log 
+ CELERYD_PID_FILE=/srv/www/web-system/pids/celery/%n.pid 
+ CELERYD_LOG_LEVEL=INFO 
+ CELERYD_MULTI=/srv/www/web-system/myproject/manage.py celeryd_multi 
+ CELERYCTL=/srv/www/web-system/myproject/manage.py celeryctl 
+ CELERYD_USER=myproject 
+ CELERYD_GROUP=myproject 
+ export DJANGO_SETTINGS_MODULE=myproject.settings 
+ CELERY_CREATE_DIRS=0 
+ CELERY_CREATE_RUNDIR=0 
+ CELERY_CREATE_LOGDIR=0 
+ [ -z /srv/www/sistema-web/pids/celery/%n.pid ] 
+ [ -z /srv/www/sistema-web/logs/celery/%n.log ] 
+ CELERYD_LOG_LEVEL=INFO 
+ CELERYD_MULTI=/srv/www/web-system/myproject/manage.py celeryd_multi 
+ CELERYD=-m celery.bin.celeryd_detach 
+ CELERYD_NODES=w1 w2 w3 w4 w5 w6 w7 w8 
+ export CELERY_LOADER 
+ [ -n ] 
+ dirname /srv/www/web-system/logs/celery/%n.log 
+ CELERYD_LOG_DIR=/srv/www/web-system/logs/celery 
+ dirname /srv/www/web-system/pids/celery/%n.pid 
+ CELERYD_PID_DIR=/srv/www/web-system/pids/celery 
+ [ -n yougrups ] 
+ DAEMON_OPTS= --uid=myprojects 
+ [ -n yougrups ] 
+ DAEMON_OPTS= --uid=myprojects --gid=myprojects 
+ [ -n /srv/www/web-system/myprojects ] 
+ DAEMON_OPTS= --uid=myproject --gid=myproject --workdir=/srv/www/web-system/myproject 
+ export PATH=/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/sbin:/bin:/usr/games:/usr/sbin:/sbin 
+ check_dev_null 
+ [ ! -c /dev/null ] 
+ check_paths 
+ [ 0 -eq 1 ] 
+ [ 0 -eq 1 ] 
+ start_workers 
+ /srv/www/web-system/myproject/manage.py celeryd_multi start w1 w2 w3 w4 w5 w6 w7 w8 --uid=myproject --gid=myproject --workdir=/srv/www/web-system/myproject --pidfile=/srv/www/web-system/pids/celery/%n.pid --logfile=/srv/www/web-system/logs/celery/%n.log --loglevel=INFO --cmd=-m celery.bin.celeryd_detach --time-limit=300 --concurrency=8 
celeryd-multi v3.0.21 (Chiastic Slide) 
> Starting nodes... 
    > w1.myproject: OK 
    > w2.myproject: OK 
    > w3.myproject: OK 
    > w4.myproject: OK 
    > w5.myproject: OK 
    > w6.myproject: OK 
    > w7.myproject: OK 
    > w8.myproject: OK 
+ sleep 5 
+ exit 0 

然後,當我檢查pid目錄時,它是空的,並且ps aux表示沒有關於它的活動進程。日誌中也沒有任何內容。我沒有使用virtualenv。它剛剛停止工作。 django-celery的版本是3.0.21。這裏是我的wsgi腳本:

#!/usr/bin/python 
# -*- coding: utf-8 -*- 
import os 
import sys 

path = '/srv/www/web-system/' 
if path not in sys.path: 
    sys.path.append(path) 
    sys.path.append(path + 'myproject/') 

os.environ['DJANGO_SETTINGS_MODULE'] = 'myproject.settings' 

import djcelery 
djcelery.setup_loader() 

import django.core.handlers.wsgi 
application = django.core.handlers.wsgi.WSGIHandler() 

這是我djcelery相關設置:

# For Celery & RabbitMQ: 
# Create user and vhost for RabbitMQ with the following commands: 
# 
# $ sudo rabbitmqctl add_user myproject <mypassword> 
# $ sudo rabbitmqctl add_vhost myproject 
# $ sudo rabbitmqctl set_permissions -p myproject myproject ".*" ".*" ".*" 
# Format: amqp://user:[email protected]:port/vhost 
BROKER_URL = 'amqp://myproject:[email protected]:5672/myproject' 
import djcelery 
djcelery.setup_loader() 

請,任何建議將非常感激!!!!在此先感謝...

+0

你有沒有解決過這個問題? – wdh

+2

@wdh:是的!我解決了它,但我不記得到底發生了什麼。芹菜執行的代碼存在錯誤(我認爲他們處於我的定期任務中),當他們因某種原因墜毀時,芹菜也崩潰了。我把所有與芹菜有關的代碼hahahaha的標誌(印刷品),直到我發現什麼是崩潰。我想一些例外不會被芹菜抓住,並讓芹菜停止。但是我再說一遍,我不記得到底發生了什麼。我希望這個老的非專業的調試技術可以幫助你:) – Throoze

+1

謝謝,這真的很有用! – wdh

回答

2

您的代碼中可能存在錯誤。嘗試使用

celery worker -A appname 

手動如果它拋出一個錯誤運行它,那麼你知道這是什麼不妥的地方。

+0

檢查問題中的註釋。它已經解決了,正如你所說,它是我的代碼中的一個錯誤。感謝您的貢獻! – Throoze

0

它最有可能與內存做你的系統
Info Logs [2017年8月2日10:00:32004:CRITICAL/MainProcess]上不可恢復的錯誤:OSERROR(12, '無法分配內存') 回溯(最近呼叫最後)

我只是在調試我的感謝@Adriaan

+1

這只是一個猜測......因此更多的是評論而不是答案。 – GhostCat