firebird-support - Re: [firebird-support] Problem with FB database that freezes

Subject	Re: [firebird-support] Problem with FB database that freezes
Author
Post date	2015-07-23T22:23:33Z

Hi Mark,

Thanks for your insightful response. FWIW, I would like to mention that, in the same server, we have another database (same size ~7 GB) no one connects to, it's a restore of the production database from January this year. This database works perfectly even when the production database is down. We try only a few test connections though.

Below is some of the requested information, at a time when the production database performance is normal.

I beg you to please read it until the end. You might have nailed something.

Thanks again. Hope to hear from you soon,

-Eduardo

Firebird.conf:

---------------------

DefaultDbCachePages = 1024

#FileSystemCacheThreshold = 65536 (commented out)

#FileSystemCacheSize = 0 (commented out)

Server environment:

--------------------------

CPU utilization: 11%

Memory utilization: 11 GB (out of 32)

Note.- Even when the DB performance is down, this values are in the same range or even lower. No swapping.

gstat output (normal performance):

---------------------------------------------------------

Database header page information:
Flags   0
Checksum  12345
Generation  19572161
Page size  16384
ODS version  11.2
Oldest transaction 18709808
Oldest active  18953295
Oldest snapshot  18851591
Next transaction 19520857
Bumped transaction 1
Sequence number  0
Next attachment ID 50438
Implementation ID 26
Shadow count  0
Page buffers  3000
Next header page 0
Database dialect 1
Creation date  Jul 7, 2015 7:00:57
Attributes  no reserve

Variable header data:
Database backup GUID: {BF8D26E0-970E-431A-7FAD-E2D9BDB2E4DA}
Sweep interval: 0
*END*

Note.- We seep the database manually each night.

fb_lock_print output (normal performance):

----------------------------------------------------------------

LOCK_HEADER BLOCK
Version: 145, Active owner:      0, Length: 28311552, Used: 27588104
Flags: 0x0001
Enqs: 69364533, Converts: 192066, Rejects: 36029, Blocks: 282250
Deadlock scans:      7, Deadlocks:      0, Scan interval: 10
Acquires: 77720068, Acquire blocks: 2159883, Spin count:   0
Mutex wait: 2.8%
Hash slots: 1009, Hash lengths (min/avg/max):   51/ 66/ 81
Remove node:      0, Insert queue:      0, Insert prior:      0
Owners (145): forward: 441288, backward: 98120
Free owners (11): forward: 24695928, backward: 23070064
Free locks (2963): forward: 22024, backward: 27499760
Free requests (42905): forward: 22145288, backward: 25253392
Lock Ordering: Enabled

Firebird.log (IBMCASA is the server's host name)

------------------------------------------------------------------

The log is literally FULL of 10053 and 10054 error entries like the following:

IBMCASA Thu Jul 23 10:27:27 2015
Unable to complete network request to host "IBMCASA".
Error writing data to the connection.

IBMCASA Thu Jul 23 10:27:29 2015
Unable to complete network request to host "IBMCASA".
Error reading data from the connection.

IBMCASA Thu Jul 23 10:27:30 2015
INET/inet_error: read errno = 10054

According to the log, this errors seems to be happening every second or every few seconds/minutes, since March 8 2014 and until today even as I'm writing this. Each day, this errors stop at 11:49 PM when the last users stop working on the client apps, then they'll start again every morning at 6:00 AM when the first client apps connect to the database.