[Twisted-Python] buildbot.twistedmatrix.com is down a lot
Glyph Lefkowitz
glyph at twistedmatrix.com
Wed Jul 20 15:31:54 MDT 2016
> On Jul 20, 2016, at 11:01 AM, Adi Roiban <adi at roiban.ro> wrote:
>
>
>
> On 20 July 2016 at 17:51, Glyph Lefkowitz <glyph at twistedmatrix.com <mailto:glyph at twistedmatrix.com>> wrote:
>
>> On Jul 20, 2016, at 6:31 AM, Adi Roiban <adi at roiban.ro <mailto:adi at roiban.ro>> wrote:
>>
>>
>>
>> On 18 July 2016 at 19:04, James Broadhead <jamesbroadhead at gmail.com <mailto:jamesbroadhead at gmail.com>> wrote:
>> On 17 July 2016 at 07:21, Amber Brown <hawkowl at atleastfornow.net <mailto:hawkowl at atleastfornow.net>> wrote:
>> It's OOMing (...)
>>
>>
>> Have you considered something like monit[1] to detect & restart in cases like this?
>>
>>
>> This might help, but will not help up understand what we are doing wrong :)
>>
>> After disabling the github webhooks, the buildbot look stable... so we might have a clue about what goes wrong.
>>
>> Right now I don't have time to look into this issue, so github hooks are disabled for now from the GitHub UI.
>
> Can someone who's had a direct look at the OOMing process (adi? amber?) report this upstream? It's a real pity that we won't get github statuses for buildbot builds any more; that was a huge step in the right direction.
>
>
> I don't know how to grasp this.
> By the time I was observing the issue, the buildbot process was already dead.
Yeah, these types of issues are tricky to debug. Thanks for looking into it nonetheless; I was hoping you knew more, but if you don't, nothing to be done.
> I have recently discovered the Rackspace monitoring capabilities for VM... and set up a memory notification... not sure who will receive the alerts.
I'll make sure that the relevant people are on the monitoring list.
> I have re-enable to GitHub hooks and will start taking a closer look at the buildmaster process.... but maybe 2GB is just not enough for a buildmaster.
Thanks.
> I have triggered the creation of an image for the current buildbot machine and will consider upgrading the buildbot to 4GB of memory to see if we still hit the ceiling.
> For my project I have a similar buildmaster based on number of builders and slaves (without github hooks and without linter factories) and in 2 weeks of uptime the virtual memory usage is 1.5GB
> .... so mabybe 2GB is just not enough for buildbot.
Bummer. It does seem like that's quite likely.
-glyph
-------------- next part --------------
An HTML attachment was scrubbed...
URL: </pipermail/twisted-python/attachments/20160720/b0ae6073/attachment-0002.html>
More information about the Twisted-Python
mailing list