View Issue Details

IDProjectCategoryView StatusLast Update
0001291bareos-core[All Projects] directorpublic2020-12-18 11:31
ReporterhostedpowerAssigned To 
PriorityhighSeveritymajorReproducibilityrandom
Status newResolutionopen 
PlatformLinuxOSDebianOS Version10
Product Version19.2.9 
Fixed in Version 
Summary0001291: Since upgrade a lot of jobs start hanging
DescriptionHi,

Since recently we see a lot of jobs which seem to start, but they never really start. They hang somewhere in the beginning of the backup job. We suspect it started when we upgraded to latest version.


Steps To ReproduceA sample:

2020-12-17 08:07:34 pim1.hostsample.com JobId 100580: Extended attribute support is enabled <------------- hangs forever here
2020-12-17 08:07:34 pim1.hostsample.com JobId 100580: ACL support is enabled
2020-12-17 08:07:23 mydir-dir JobId 100580: Start Backup JobId 100580, Job=backup-pim1.hostsample.com.2020-12-17_00.15.01_05
2020-12-17 08:07:23 mydir-dir JobId 100580: Connected Storage daemon at backup08.xxxxxx:9103, encryption: TLS_CHACHA20_POLY1305_SHA256
2020-12-17 08:07:23 mydir-dir JobId 100580: Using Device "hostsamplepim1-incr" to write.
2020-12-17 08:07:23 mydir-dir JobId 100580: Connected Client: pim1.hostsample.com at xxx.xxx.xxx.xxx:9102, encryption: TLS_CHACHA20_POLY1305_SHA256
2020-12-17 08:07:23 mydir-dir JobId 100580: Handshake: Immediate TLS
2020-12-17 08:07:23 mydir-dir JobId 100580: Encryption: TLS_CHACHA20_POLY1305_SHA256
2020-12-17 08:07:23 mydir-dir JobId 100580: Sending Accurate information.
2020-12-17 08:07:23 pim1.hostsample.com JobId 100580: Connected Storage daemon at backup08.xxxxxx:9103, encryption: TLS_CHACHA20_POLY1305_SHA256

Additional InformationWe tried waiting, but they just sit there hours and hours :(
TagsNo tags attached.
bareos-master: impact
bareos-master: action
bareos-19.2: impact
bareos-19.2: action
bareos-18.2: impact
bareos-18.2: action
bareos-17.2: impact
bareos-17.2: action
bareos-16.2: impact
bareos-16.2: action
bareos-15.2: impact
bareos-15.2: action
bareos-14.2: impact
bareos-14.2: action
bareos-13.2: impact
bareos-13.2: action
bareos-12.4: impact
bareos-12.4: action

Activities

hostedpower

hostedpower

2020-12-17 09:44

reporter   ~0004071

We checked further, a high percentage of the jobs has these problems at this moment, it's quite severe :(
hostedpower

hostedpower

2020-12-18 11:31

reporter   ~0004073

We went back to bareos 19.2.8 on the director node, but we had the feeling the jobs still got stuck. Afterwards we also downgraded all storage daemons to 19.2.8 and rebooted all backup servers.

Since we did these steps, all seems back to normal.

Any idea something changed in the storage daemon which can cause this behavior?

Issue History

Date Modified Username Field Change
2020-12-17 09:34 hostedpower New Issue
2020-12-17 09:44 hostedpower Note Added: 0004071
2020-12-18 11:31 hostedpower Note Added: 0004073