14:30:46 #startmeeting Pulp Triage 2016-07-01 14:30:46 #info asmacdo has joined triage 14:30:47 Meeting started Fri Jul 1 14:30:46 2016 UTC and is due to finish in 60 minutes. The chair is asmacdo. Information about MeetBot at http://wiki.debian.org/MeetBot. 14:30:47 Useful Commands: #action #agreed #help #info #idea #link #topic #startvote. 14:30:47 The meeting name has been set to 'pulp_triage_2016_07_01' 14:30:47 asmacdo has joined triage 14:30:48 feel free to ping me in #pulp 14:30:49 @here 14:30:51 !here 14:30:51 #info bmbouter has joined triage 14:30:52 bmbouter has joined triage 14:30:57 !here 14:30:57 #info preethi has joined triage 14:30:58 preethi has joined triage 14:31:34 !here 14:31:34 #info dalley has joined triage 14:31:34 dalley has joined triage 14:31:34 https://pulp.plan.io/issues?query_id=30 14:31:36 Title: Un-Triaged Bugs - Pulp (at pulp.plan.io) 14:31:39 !here 14:31:39 #info dkliban has joined triage 14:31:39 dkliban has joined triage 14:31:45 !next 14:31:46 7 issues left to triage: 2039, 2042, 2045, 2047, 2049, 2050, 2051 14:31:46 #topic Not full importer config is validated in the update call - http://pulp.plan.io/issues/2039 14:31:47 Pulp Issue #2039 [POST] (ipanova@redhat.com) - Priority: Normal | Severity: Medium 14:31:47 Not full importer config is validated in the update call - http://pulp.plan.io/issues/2039 14:31:55 !here 14:31:55 #info pcreech has joined triage 14:31:56 pcreech has joined triage 14:32:10 already in post 14:32:14 !propose accept 14:32:14 #idea Proposed for #2039: Leave the issue as-is, accepting its current state. 14:32:14 Proposed for #2039: Leave the issue as-is, accepting its current state. 14:32:26 +1 14:32:34 it is in post but Ina told me that she want the team to be aware of this bug 14:32:45 ah ok 14:32:45 I do not know much more about it 14:32:58 !here 14:32:58 #info mhrivnak has joined triage 14:32:58 mhrivnak has joined triage 14:33:02 but that is the reason why it is not marked as triaged 14:33:28 +1 accept 14:33:34 anyone else? 14:33:55 !accept 14:33:55 #agreed Leave the issue as-is, accepting its current state. 14:33:56 Current proposal accepted: Leave the issue as-is, accepting its current state. 14:33:57 #topic updateinfo.xml is missing checksums - http://pulp.plan.io/issues/2042 14:33:57 6 issues left to triage: 2042, 2045, 2047, 2049, 2050, 2051 14:33:57 RPM Support Issue #2042 [NEW] (unassigned) - Priority: Normal | Severity: Medium 14:33:58 updateinfo.xml is missing checksums - http://pulp.plan.io/issues/2042 14:34:43 ttereshc, so am I reading this correctly that the default behavior changed with 2.9 regarding how checksums are included? 14:35:20 before 2.9 pulp did not handle multiple checksums atll 14:36:08 oh, that's another issue, sorry 14:37:03 yes we publish the checksum of the distributor checksum_type for the packages in the erratum 14:37:13 I think what ichimonji10 is pointing out with this bug is that if you just sync and publish a repo with default settings, in 2.8 you see checksums in errata, and in 2.9 you don't. 14:37:14 !here 14:37:14 #info jcline has joined triage 14:37:15 jcline has joined triage 14:37:30 I assume you don't because the checksums don't match the checksum used for the rest of the repo metadata? 14:38:35 If that's true, perhaps we should relax that and allow the previous behavior to continue. So by default it will publish whatever checksum value is available, and will only change behavior if the new setting is used. 14:39:00 Sorry to get into the weeds on details, but I'm trying to confirm whether we think this is a bug. :) 14:39:13 mhrivnak: That's exactly it. 14:39:21 im afraid i am not familiar enough with this one to add to the conversation, perhaps lets skip this and discuss on comments 14:39:24 if there are multiple checksums and none is matched the repo checksum, what to do? 14:39:48 in 2.8 it would be an error 14:40:02 ttereshc: The repo distributor has *no* checksum configured 14:40:22 Hm, ok. I'm with asmacdo that we can continue this discussion post-triage and move on for now. ttereshc ichimonji10 does that work for you? 14:40:23 ichimonji10, then it is sha256, iirc 14:40:43 mhrivnak, sure 14:40:48 ttereshc: Let's catch up on this via video chat later 14:40:50 !propose skip 14:40:50 #idea Proposed for #2042: Skip this issue for this triage session. 14:40:50 Proposed for #2042: Skip this issue for this triage session. 14:40:51 It's an interesting one 14:41:07 ichimonji10, mhrivnak ttereshc lets try to do that soon 14:41:19 if this is a bug, it sounds like it might need to block a release? 14:41:23 I'm free the rest of today 14:41:31 *most of today 14:41:33 !accept 14:41:33 #agreed Skip this issue for this triage session. 14:41:33 Current proposal accepted: Skip this issue for this triage session. 14:41:35 #topic Task stuck at waiting if child process segfaults - http://pulp.plan.io/issues/2045 14:41:35 5 issues left to triage: 2045, 2047, 2049, 2050, 2051 14:41:35 Pulp Issue #2045 [NEW] (unassigned) - Priority: Normal | Severity: High 14:41:36 Task stuck at waiting if child process segfaults - http://pulp.plan.io/issues/2045 14:41:48 jortel and I discovered this together 14:42:02 oof. 14:42:06 I wanted to triage it onto the sprint acually 14:42:17 or triage it not on the sprint but to high/high 14:42:22 This sounds similar to a bug rbarlow filed about tasks that got OOM-killed also waited forever 14:42:36 jcline: oh yes I do remember something about that 14:42:47 let's triage this one and I can search to deduplicate 14:42:49 !propose triage high high 14:42:49 #idea Proposed for #2045: Priority: High, Severity: High 14:42:50 Proposed for #2045: Priority: High, Severity: High 14:42:52 https://pulp.plan.io/issues/1673 14:42:54 Title: Issue #1673: Pulp's worker watcher does not notice workers that got killed by OOM killer and their tasks stay "running" forever - Pulp (at pulp.plan.io) 14:43:03 ah nice 14:43:06 I wonder if they can both be fixed by one thing? 14:43:24 probably. 14:43:27 they are almost certainly the same 14:43:32 any thoughts re: put this on the sprint? 14:43:40 the key detail that we know now is that it's specifically the child process 14:43:52 the parent celery process does not have this problem 14:44:10 oh there is some discussion of that on 1673 also actually 14:44:27 ok so we can close the new one as a dup 14:44:30 Yep. It's a painful user experience when this happens, so it's a big opportunity for us to improve the experience. 14:44:46 it will be easy I think, I outlined 3 solutions on 2045 14:44:53 bmbouter, perhaps let's discuss adding 1673 to a sprint separately? 14:45:01 ichimonji10, mhrivnak, asmacdo, I can talk after triage 14:45:10 thanks ttereshc 14:45:10 sounds good, let's close 2045 as a dup then 14:45:23 bmbouter, celery doesn't have watch dog for tasks? 14:45:30 !propose other close as suplicate of #1673 14:45:30 #idea Proposed for #2045: close as suplicate of #1673 14:45:30 Proposed for #2045: close as suplicate of #1673 14:45:33 !propose other close as duplicate of #1673 14:45:33 #idea Proposed for #2045: close as duplicate of #1673 14:45:34 Proposed for #2045: close as duplicate of #1673 14:45:36 it does the defect is in the pulp bookeeping 14:46:00 bmbouter, ah 14:46:02 asmacdo: I'm going to go ahead and close it now as a dup 14:46:14 bmbouter++ 14:46:14 bmbouter's karma is now 22 14:46:53 !accept 14:46:53 #agreed close as duplicate of #1673 14:46:54 Current proposal accepted: close as duplicate of #1673 14:46:55 #topic directory creation race condition during publish - http://pulp.plan.io/issues/2047 14:46:55 4 issues left to triage: 2047, 2049, 2050, 2051 14:46:55 Pulp Issue #2047 [NEW] (unassigned) - Priority: Normal | Severity: Medium 14:46:56 directory creation race condition during publish - http://pulp.plan.io/issues/2047 14:47:08 * mhrivnak looks for a katello person 14:47:14 partha ! 14:47:35 maybe too early :) 14:47:50 Sat QE hit this problem. 14:48:16 I'm a bit surprised, but we've seen race conditions like this before that seemed unlikely, but they find a way to run into them. 14:48:43 yeah I read this issue and it made sense to me 14:48:46 as did the recommended solution 14:48:55 more pythonic anyway 14:49:03 !propose accept 14:49:03 #idea Proposed for #2047: Leave the issue as-is, accepting its current state. 14:49:04 Proposed for #2047: Leave the issue as-is, accepting its current state. 14:49:08 +1 14:49:54 !accept 14:49:54 #agreed Leave the issue as-is, accepting its current state. 14:49:54 Current proposal accepted: Leave the issue as-is, accepting its current state. 14:49:56 #topic Django RemovedInDjango110Warning in logs for missing TEMPLATES setting - http://pulp.plan.io/issues/2049 14:49:56 3 issues left to triage: 2049, 2050, 2051 14:49:56 Pulp Issue #2049 [NEW] (unassigned) - Priority: High | Severity: Medium 14:49:57 Django RemovedInDjango110Warning in logs for missing TEMPLATES setting - http://pulp.plan.io/issues/2049 14:50:50 bmbouter, is this a dupe? 14:50:53 No 14:51:00 Well 14:51:07 Do we use django's template loading feature? 14:51:13 There is a second one for a different deprecation 14:51:16 Yes we do 14:51:30 just curious: do know where or what for? 14:52:27 Rendering human-readable HTTP 404 and HTTP 500s, specifically in the content wsgi app 14:52:38 oh that could be 14:52:42 because I don't see it here: https://github.com/pulp/pulp/blob/master/server/pulp/server/webservices/settings.py#L21-L26 14:52:43 Title: pulp/settings.py at master · pulp/pulp · GitHub (at github.com) 14:52:45 gotcha. 14:52:46 but that is different WSGI ap 14:52:47 app 14:53:20 FWIW I actually made a pass at adding this to settings.py recently, but abandoned it because compatibility with django 1.4 was challenging. 14:53:37 Well we're about to be in a pickle, then 14:53:41 yes we are 14:53:43 Maybe not impossible, but it was at least annoying, and I determined I could continue without it. 14:53:48 * jcline chants SCL 14:54:05 * mhrivnak chants EL7 ;) 14:54:11 mhrivnak: oh yeah I remember that 14:54:31 !propose accept 14:54:31 #idea Proposed for #2049: Leave the issue as-is, accepting its current state. 14:54:31 Proposed for #2049: Leave the issue as-is, accepting its current state. 14:54:37 We can always do a version check, it shouldn't be difficult 14:54:45 one way or another, we need to get to the bottom of this. 14:55:11 any opposed to accept? 14:55:15 jcline: I was thinking verion check too 14:55:16 +1 to accept 14:55:20 good file jcline 14:55:32 !accept 14:55:32 Current proposal accepted: Leave the issue as-is, accepting its current state. 14:55:32 #agreed Leave the issue as-is, accepting its current state. 14:55:34 2 issues left to triage: 2050, 2051 14:55:35 #topic HTTP 500 rather than a HTTP 404 when a missing resource is requested - http://pulp.plan.io/issues/2050 14:55:35 Pulp Issue #2050 [NEW] (unassigned) - Priority: High | Severity: Low 14:55:35 HTTP 500 rather than a HTTP 404 when a missing resource is requested - http://pulp.plan.io/issues/2050 14:56:12 this issue is written generally, but it's specific to that 1 URL 14:56:29 oh no wait it is for any URL that doesn't match 14:56:30 bmbouter, are you certain of that? 14:56:59 !propose accept 14:56:59 #idea Proposed for #2050: Leave the issue as-is, accepting its current state. 14:56:59 Proposed for #2050: Leave the issue as-is, accepting its current state. 14:57:09 definitely bad. 14:57:59 jcline: yeah it stays in all django code so this is a general problem 14:58:20 other thoughts? 14:58:43 I'm fine with accepting it as-is, naturally 14:58:56 !accept 14:58:56 #agreed Leave the issue as-is, accepting its current state. 14:58:56 Current proposal accepted: Leave the issue as-is, accepting its current state. 14:58:57 1 issues left to triage: 2051 14:58:58 #topic Ludicrous number of WARNING logs from publish - http://pulp.plan.io/issues/2051 14:58:58 RPM Support Issue #2051 [NEW] (unassigned) - Priority: High | Severity: High 14:58:58 Ludicrous number of WARNING logs from publish - http://pulp.plan.io/issues/2051 14:59:43 are the warnings about something specific? 15:00:30 mhrivnak, see the log snippet in the report. That's all there is. 15:00:31 it looks like NEVRA 15:00:56 Yep, it's just hard to read and appears truncated. 15:01:21 In any case, it's highly unfortunate and we should fix it. :) 15:01:23 it's likely one line for all the statements in a loop 15:01:24 I didn't think anyone wanted to read 11k NEVRAs 15:01:41 do we want to propose just lowering to DEBUG? 15:01:44 and adding easyfix? 15:01:52 both of those sound good 15:01:55 The logs have a max line length and will get truncated 15:02:21 !propose triage high low 15:02:21 #idea Proposed for #2051: Priority: High, Severity: Low 15:02:21 Proposed for #2051: Priority: High, Severity: Low 15:02:29 let's not propose DEBUG change and just triage with adding easy fix 15:02:49 bmbouter, i meant to add DEBUG as a comment 15:03:01 Oh ya, it's just one line. 15:03:04 easy fix. 15:03:22 thats ok though, im sure whoever picks this up will evaluate what to do 15:03:46 i changed severity from high to low, is everyon ok with that? 15:03:48 jcline, ? 15:03:50 +1 15:04:09 it does make the logs unusable 15:04:12 I don't really mind one way or the other, but it's very ruinous to the logs 15:04:37 high/low sounds ok since the prio is high 15:04:51 k 15:05:00 #agreed Priority: High, Severity: Low 15:05:00 !accept 15:05:00 Current proposal accepted: Priority: High, Severity: Low 15:05:01 No issues to triage. 15:05:07 yay 15:05:18 !end 15:05:18 #endmeeting