Cross Reference: /external/autotest/scheduler/monitor_db

History log of /external/autotest/scheduler/monitor_db_cleanup.py
Revision	Date	Author	Comments (<<< Hide modified files) (Show modified files >>>)
0f261debdbba7664ef7b0eacbac0e7daba89eebd	02-Feb-2017	Allen Li <ayatane@chromium.org>	[autotest] [atomic] Remove atomic groups from scheduler BUG=chromium:681906 TEST=Run unittest suite Change-Id: If9c144aae8d2a8df567a5a03b02bc3fec5d14c0d Reviewed-on: https://chromium-review.googlesource.com/435565 Commit-Ready: Allen Li <ayatane@chromium.org> Tested-by: Allen Li <ayatane@chromium.org> Reviewed-by: Richard Barnette <jrbarnette@google.com> /external/autotest/scheduler/monitor_db_cleanup.py
5e2efb71ffebead22aa4f0744ad843ee79814b43	07-Feb-2017	Dan Shi <dshi@google.com>	[autotest] Use the metrics_mock object in case chromite is not set up. BUG=chromium:688166 TEST=unittest Change-Id: Ic0077cb2dba75a8d820f229060f3f70f507850a1 Reviewed-on: https://chromium-review.googlesource.com/438754 Commit-Ready: Dan Shi <dshi@google.com> Tested-by: Dan Shi <dshi@google.com> Reviewed-by: Dan Shi <dshi@google.com> /external/autotest/scheduler/monitor_db_cleanup.py
c29b4c7ec10db41f38e0361febe9846a95629b5a	15-Dec-2016	Aviv Keshet <akeshet@chromium.org>	autotest: delete some email alerts; replace some with monarch metrics For email alerts that seem (based on searching my email) to never be sent, I simply deleted them. For those that are sent sometimes and seem easily amenable to a monarch metric instead, I changed them to a metric. This is a first step; there are still many remaining unneccesary email alerts. BUG=chromium:672726 TEST=None Change-Id: Ib1d3715e618623faa16f3faaceabf4218dbad49a Reviewed-on: https://chromium-review.googlesource.com/420468 Commit-Ready: Aviv Keshet <akeshet@chromium.org> Tested-by: Aviv Keshet <akeshet@chromium.org> Reviewed-by: Aviv Keshet <akeshet@chromium.org> /external/autotest/scheduler/monitor_db_cleanup.py
598671c5efb6cacd564a71e0c92bcc897d10afc1	23-Nov-2016	Shuqian Zhao <shuqianz@chromium.org>	[autotest] remove autotest_stats from monitor_db_cleanup BUG=chromium:667171 TEST=unittest Change-Id: I962df6f564f6608645a4813d04200a682aa1fe00 Reviewed-on: https://chromium-review.googlesource.com/413633 Commit-Ready: Dan Shi <dshi@google.com> Tested-by: Shuqian Zhao <shuqianz@chromium.org> Reviewed-by: Prathmesh Prabhu <pprabhu@chromium.org> /external/autotest/scheduler/monitor_db_cleanup.py
f2c9ac9f514b2c56799974c98c896ae03119c34d	17-Sep-2016	Aviv Keshet <akeshet@chromium.org>	autotest: add stats around db cleanup time Note, this change requires 386284 to be pushed to prod before it can land. CQ-DEPEND=CL:386284 BUG=chromium:626423 TEST=None Change-Id: Ie78b731a2797574c89f3a0f855ed2f80768395a2 Reviewed-on: https://chromium-review.googlesource.com/386333 Commit-Ready: Aviv Keshet <akeshet@chromium.org> Tested-by: Aviv Keshet <akeshet@chromium.org> Reviewed-by: Aviv Keshet <akeshet@chromium.org> /external/autotest/scheduler/monitor_db_cleanup.py
0a1bb171d722230a19a067d823679d55068f9c2a	18-Sep-2016	Dan Shi <dshi@google.com>	[autotest] Optimize query to abort timed out tests in monitor_db_cleanup Replace join with subquery to fasten the query. BUG=chromium:641290 TEST=local run, unittest Change-Id: Ic4bc55bc6122dfb1d70b9b06c96e3dc4629ba5e8 Reviewed-on: https://chromium-review.googlesource.com/386791 Tested-by: Dan Shi <dshi@google.com> Reviewed-by: Aviv Keshet <akeshet@chromium.org> /external/autotest/scheduler/monitor_db_cleanup.py
3762c6b48420043b48c33b952a918eba44af152d	17-Sep-2016	Dan Shi <dshi@google.com>	[autotest] Optimize query in monitor_db_cleanup BUG=chromium:641290 TEST=local run Change-Id: Id46be265d4c4d1bd5ef2b44ea41d6e7b17202de9 Reviewed-on: https://chromium-review.googlesource.com/386361 Commit-Ready: Dan Shi <dshi@google.com> Tested-by: Dan Shi <dshi@google.com> Reviewed-by: Zelidrag Hornung <zelidrag@chromium.org> Reviewed-by: Aviv Keshet <akeshet@chromium.org> /external/autotest/scheduler/monitor_db_cleanup.py
55d5899addf09ad0fa4a6ea7260e7c7b143b430b	05-May-2015	Dan Shi <dshi@chromium.org>	[autotest] Queue calls in drone after drone refresh. Drone refresh is done in a non-thread safe fashion. It starts the refresh at the beginning of the tick, then follow by couple other operations, then wait for the refresh to finish. When it starts, it executes all queued calls in drone using drone_utils. After drone_utils finishes processing the calls, the scheduler will empty the queued calls in drones. That means any calls added between the drone refresh is started and the completion of drone refresh will be removed without being called. This CL moves the cleanup call after the drone refresh, also add a comment about potential future issues. A better fix might fix the root cause. For example, add a tracker in each drone's call queue. After drone refresh is done, only clear the calls being processed within refresh. crbug.com/484715 is filed to track this issue. BUG=chromium:484039 TEST=local scheduler run, make sure lxc_cleanup is kicked off and finished. Change-Id: I1bb3229a3da578299949a00af25b3d4674eeed4b Reviewed-on: https://chromium-review.googlesource.com/269255 Trybot-Ready: Dan Shi <dshi@chromium.org> Tested-by: Dan Shi <dshi@chromium.org> Reviewed-by: Richard Barnette <jrbarnette@chromium.org> Reviewed-by: Simran Basi <sbasi@chromium.org> Commit-Queue: Dan Shi <dshi@chromium.org> /external/autotest/scheduler/monitor_db_cleanup.py
4ecb004eb95b59977354337d4c8d13ba16b182f9	04-May-2015	Dan Shi <dshi@chromium.org>	[autotest] Disable lxc_cleanup task to avoid scheduler crash. BUG=chromium:484039 TEST=None Change-Id: Ia1c03275792b1c1681c6cbc5e8a202c6c3441b69 Reviewed-on: https://chromium-review.googlesource.com/268991 Reviewed-by: Dan Shi <dshi@chromium.org> Commit-Queue: Dan Shi <dshi@chromium.org> Trybot-Ready: Dan Shi <dshi@chromium.org> Tested-by: Dan Shi <dshi@chromium.org> /external/autotest/scheduler/monitor_db_cleanup.py
c458f66959fde1d934abfff92d20b2dbf115b9c2	29-Apr-2015	Dan Shi <dshi@chromium.org>	[autotest] Add lxc_cleanup to TwentyFourHourUpkeep of the scheduler. So scheduler can kick off lxc_cleanup script in each drone every 24 hours. cron job requires puppet change and it's not supported in moblab. BUG=chromium:479383 TEST=start scheduler locally, check logs/lxc_cleanup.log to confirm the script finished running. Change-Id: I83ebfd6b0888b6f3b2c58d1f3824a692660bd4f7 Reviewed-on: https://chromium-review.googlesource.com/268318 Reviewed-by: Dan Shi <dshi@chromium.org> Commit-Queue: Dan Shi <dshi@chromium.org> Trybot-Ready: Dan Shi <dshi@chromium.org> Tested-by: Dan Shi <dshi@chromium.org> /external/autotest/scheduler/monitor_db_cleanup.py
b715ea477b64ac4814fea81f7988ee4ef353e9a0	24-Feb-2015	Fang Deng <fdeng@chromium.org>	Revert "[autotest] Teach the scheduler to recover stuck hosts" This reverts commit a6c0d253e16cf6ddf94a75ef56e50e376b09411c. Change-Id: Id430e6d563345d39f55d2335da3ee5f2918da205 Reviewed-on: https://chromium-review.googlesource.com/253020 Reviewed-by: Fang Deng <fdeng@chromium.org> Commit-Queue: Fang Deng <fdeng@chromium.org> Tested-by: Fang Deng <fdeng@chromium.org> /external/autotest/scheduler/monitor_db_cleanup.py
a6c0d253e16cf6ddf94a75ef56e50e376b09411c	12-Feb-2015	Fang Deng <fdeng@chromium.org>	[autotest] Teach the scheduler to recover stuck hosts If we have hosts stuck in a state without a task or hqe the scheduler should recover the host. The correctness of this CL relies on the fact that - the scheduler is single threaded and - in the epilog of a task agent, deactivation of the special task/hqe, if occurs, must always occur together with updating host status to the right status within the same tick, otherwise we will have race condition on updating the host status. BUG=chromium:432549 TEST=add unittest. run scheduler, manually corrupt the database, and observer the host is recovered. DEPLOY=scheduler Change-Id: Iab39fd5301e14cfa9623ae09ab80589dac9c7564 Reviewed-on: https://chromium-review.googlesource.com/249010 Tested-by: Fang Deng <fdeng@chromium.org> Reviewed-by: Mungyung Ryu <mkryu@google.com> Commit-Queue: Fang Deng <fdeng@chromium.org> /external/autotest/scheduler/monitor_db_cleanup.py
1e1c41b1b4a1b97c0b7086b8430856ed45e064d3	05-Feb-2015	Gabe Black <gabeblack@chromium.org>	graphite: Separate out configuration from the statsd classes. The new version of the statsd classes should be created using an instance of the new Statsd class which sets up some defaults without having to specify them over and over. This makes it essentially compatible with the existing usage in autotest, but will allow chromite to configure things differently and avoid having side effects from importing the module or global state. BUG=chromium:446291 TEST=Ran unit tests, ran stats_es_functionaltest.py, ran the stats_mock_unittest, ran a butterfly-paladin tryjob with --hwtest, testing by fdeng. DEPLOY=apache,scheduler,host-scheduler Change-Id: I1071813db197c0e5e035b4d8db615030386f1c1c Reviewed-on: https://chromium-review.googlesource.com/246428 Reviewed-by: Fang Deng <fdeng@chromium.org> Reviewed-by: Dan Shi <dshi@chromium.org> Commit-Queue: Gabe Black <gabeblack@chromium.org> Tested-by: Gabe Black <gabeblack@chromium.org> /external/autotest/scheduler/monitor_db_cleanup.py
ac189f3c6cafa7d445162b5ec54e4162d0e679b2	23-Jun-2014	Alex Miller <milleral@chromium.org>	[autotest] Remove indirection in scheduler config. We shouldn't encourage things to be named two different things in two different places. BUG=None DEPLOY=scheduler TEST=ran scheduler Change-Id: I0cfac73f7c2dbc0130f0399d96feda257915cd34 Reviewed-on: https://chromium-review.googlesource.com/205720 Reviewed-by: Prashanth B <beeps@chromium.org> Reviewed-by: Fang Deng <fdeng@chromium.org> Tested-by: Alex Miller <milleral@chromium.org> Reviewed-by: Alex Miller <milleral@chromium.org> Commit-Queue: Alex Miller <milleral@chromium.org> /external/autotest/scheduler/monitor_db_cleanup.py
da8c60af1e1e3ee97170c700d0b72991687e35a2	03-Jun-2014	Michael Liang <michaelliang@chromium.org>	[autotest] Migrate graphite directory to client/common_lib/cros This change allows us to report stats in client tests. 1. Change import paths for all files that import modules from graphite 2. Clean up some unused modules Related CL: https://chromium-review.googlesource.com/#/c/202467/ BUG=chromium:237255 TEST=Ran scheduler locally, scheduled reboot jobs, verified stats such as monitor_db_cleanup.user_cleanup._cleanup were reported on chromeos-stats. DEPLOY = apache, scheduler, host_scheduler Change-Id: Iebfe3b8acc1c363a0b70ea555744e85d1367cb67 Reviewed-on: https://chromium-review.googlesource.com/202727 Reviewed-by: Dan Shi <dshi@chromium.org> Commit-Queue: Michael Liang <michaelliang@chromium.org> Tested-by: Michael Liang <michaelliang@chromium.org> /external/autotest/scheduler/monitor_db_cleanup.py
742b81d29617aab913cf512dbeeccd8606049a57	30-May-2014	Simran Basi <sbasi@chromium.org>	[autotest] move django.sessions cleanup from 24 hours to 5 mins. Currently on Moblab our database's ibdata1 file is hitting its max size causing all sorts of database problems. We suspect this is due to the lack of cleanup of the django.sessions table and are now going to run this as part of the 5 min cleanup vs the 24 hour cleanup. BUG=chromium:379199 TEST=Schedule is running on fresh moblab image. Change-Id: Ibd5261591b5edebd7cafb46a311e2513c1ca713e Reviewed-on: https://chromium-review.googlesource.com/202317 Reviewed-by: Alex Miller <milleral@chromium.org> Tested-by: Simran Basi <sbasi@chromium.org> /external/autotest/scheduler/monitor_db_cleanup.py
4ec9867f46deb969c154bebf2e64729d56c3a1d3	15-May-2014	Prashanth B <beeps@google.com>	[autotest] Split host acquisition and job scheduling II. This cl creates a stand-alone service capable of acquiring hosts for new jobs. The host scheduler will be responsible for assigning a host to a job and scheduling its first special tasks (to reset and provision the host). There on after, the special tasks will either change the state of a host or schedule more tasks against it (eg: repair), till the host is ready to run the job associated with the Host Queue Entry to which it was assigned. The job scheduler (monitor_db) will only run jobs, including the special tasks created by the host scheduler. Note that the host scheduler won't go live till we flip the inline_host_acquisition flag in the shadow config, and restart both services. The host scheduler is dead, long live the host scheduler. TEST=Ran the schedulers, created suites. Unittests. BUG=chromium:344613, chromium:366141, chromium:343945, chromium:343937 CQ-DEPEND=CL:199383 DEPLOY=scheduler, host-scheduler Change-Id: I59a1e0f0d59f369e00750abec627b772e0419e06 Reviewed-on: https://chromium-review.googlesource.com/200029 Reviewed-by: Prashanth B <beeps@chromium.org> Tested-by: Prashanth B <beeps@chromium.org> Commit-Queue: Prashanth B <beeps@chromium.org> /external/autotest/scheduler/monitor_db_cleanup.py
04be2bd5e4666a5c253e9c30ab20555e04286032	08-May-2014	Ilja H. Friedel <ihf@chromium.org>	Autotest: Change logging.warn() to logging.warning(). logging.warn() is deprecated. See http://bugs.python.org/issue13235 Substitution was performed via ~/cros/src/third_party/autotest/files$ find ./ -type f \| xargs sed -i 's/logging.warn(/logging.warning(/' BUG=None. TEST=There should be one-- and preferably only one --obvious way to do it. Change-Id: Ie5665743121a49f7fbd5d1f47896a7c65e87e489 Reviewed-on: https://chromium-review.googlesource.com/198793 Commit-Queue: Ilja Friedel <ihf@chromium.org> Tested-by: Ilja Friedel <ihf@chromium.org> Reviewed-by: Alex Miller <milleral@chromium.org> /external/autotest/scheduler/monitor_db_cleanup.py
372613d54bdfd1b708a5d41ced9a80e209e6dc6a	05-May-2014	Prashanth B <beeps@google.com>	[autotest] Sanity check host assignments. Check that we haven't violated any correctness constraints by assigning the same host to 2 simultaneously active jobs. These changes are in preperation for eventually breaking host assignment out of the scheduler. The performance degradation should be negligable since we're only querying for the host_ids of currently active jobs, every 5 minutes. TEST=Ran suites, unittests. BUG=None DEPLOY=Scheduler Change-Id: Ie560a67861f9e4d1d59cda9828fb9d2ef433e5f4 Reviewed-on: https://chromium-review.googlesource.com/198196 Reviewed-by: Prashanth B <beeps@chromium.org> Tested-by: Prashanth B <beeps@chromium.org> Commit-Queue: Prashanth B <beeps@chromium.org> /external/autotest/scheduler/monitor_db_cleanup.py
7e60574f3baa0b3928623216a74a770f19d2c316	12-Nov-2013	Simran Basi <sbasi@chromium.org>	Autotest: Update RPC, Scheduler and AFE to use timeout_mins. Now that timeout_mins is in the database, we need to update the infrastructure to display and utilize timeout_mins rather than timeout. The scheduler now aborts based off of timeout_mins. The afe displays/creates jobs using timeout_mins rather than timeout. The RPC's take in timeout and timeout_min depending on what is inputted to the system. Note the value supplied for timeout_mins takes precedence over timeout. BUG=chromium:221264 TEST=Created job through AFE, run_suite with smoke suite, job_unittest and rpc_unittests. DEPLOY=apache, afe, scheduler CQ-DEPEND=I09f8b6023a31dcef37e98c298dc0d63b37bc0a9e Change-Id: I838cd1ff85a2ce6bd49fe96bf18fd82be608c758 Reviewed-on: https://chromium-review.googlesource.com/176605 Reviewed-by: Simran Basi <sbasi@chromium.org> Commit-Queue: Simran Basi <sbasi@chromium.org> Tested-by: Simran Basi <sbasi@chromium.org> /external/autotest/scheduler/monitor_db_cleanup.py
ee632915c6709ae493ae485f3f4d063a7e7497f7	09-Oct-2013	Alex Miller <milleral@chromium.org>	[autotest] Add timers to scheduler cleanup. The peaks of scheduler cleanup are also starting to get high enough that I'd like to take a look at trying to lessen them a bit. I'm guessing there's some indexing work to be done here, but I don't know which queries to target. BUG=chromium:305408 DEPLOY=scheduler TEST=Ran scheduler Change-Id: Ibc1924a90c2502b38e6c7e1058f675f497877369 Reviewed-on: https://chromium-review.googlesource.com/172320 Tested-by: Alex Miller <milleral@chromium.org> Reviewed-by: Dan Shi <dshi@chromium.org> Commit-Queue: Alex Miller <milleral@chromium.org> /external/autotest/scheduler/monitor_db_cleanup.py
34217022229b755bc1ee52f83665acba76bd5044	06-Nov-2012	Simran Basi <sbasi@chromium.org>	Autotest: Increase run timeout granularity This CL modifies all references to max_runtime_hrs to instead be max_runtime_mins. This includes the django models, rpc interfaces, the cleanup timeout code, and the frontend java views. The frontend java code will need to be recompiled once this commits to prevent the frontend from breaking. The cleanup timeout pathway has been adjusted to find all timedout jobs by minute, and has been changed to run every 5 minutes vs every hour as before. BUG=chromium-os:36067 TEST=Ran on my local afe, ensure that jobs can still be created correctly, and jobs with short timeouts do indeed get aborted when expected. Change-Id: Idfdeb3f1d4947d6b2e6b48127a31db535704e972 Reviewed-on: https://gerrit.chromium.org/gerrit/37827 Tested-by: Simran Basi <sbasi@chromium.org> Reviewed-by: Scott Zawalski <scottz@chromium.org> Reviewed-by: Simran Basi <sbasi@chromium.org> Commit-Ready: Simran Basi <sbasi@chromium.org> /external/autotest/scheduler/monitor_db_cleanup.py
a8a0d75c2591ab872f695d7b8f22f2ca10493dab	08-Nov-2012	Scott Zawalski <scottz@chromium.org>	Revert "Autotest: Increase run timeout granularity" This causes autoupdate problems, see bug crosbug.com/36183 This reverts commit daffa57a6738c2a80caee1f311a527313ece1207 Change-Id: Iad65be9bf1b0ea329866bfdc764b0089c4f9b2a5 Reviewed-on: https://gerrit.chromium.org/gerrit/37657 Commit-Ready: Scott Zawalski <scottz@chromium.org> Reviewed-by: Scott Zawalski <scottz@chromium.org> Tested-by: Scott Zawalski <scottz@chromium.org> /external/autotest/scheduler/monitor_db_cleanup.py
daffa57a6738c2a80caee1f311a527313ece1207	06-Nov-2012	Simran Basi <sbasi@chromium.org>	Autotest: Increase run timeout granularity This CL modifies all references to max_runtime_hrs to instead be max_runtime_mins. This includes the django models, rpc interfaces, the cleanup timeout code, and the frontend java views. The frontend java code will need to be recompiled once this commits to prevent the frontend from breaking. The cleanup timeout pathway has been adjusted to find all timedout jobs by minute, and has been changed to run every 5 minutes vs every hour as before. BUG=chromium-os:36067 TEST=Ran on my local afe, ensure that jobs can still be created correctly, and jobs with short timeouts do indeed get aborted when expected. Change-Id: Id7668bbd05a9b02c22e7c549fac232fae02fc728 Reviewed-on: https://gerrit.chromium.org/gerrit/37479 Reviewed-by: Scott Zawalski <scottz@chromium.org> Commit-Ready: Simran Basi <sbasi@chromium.org> Tested-by: Simran Basi <sbasi@chromium.org> /external/autotest/scheduler/monitor_db_cleanup.py
1c5b0572f6cbc3b210ef0c5a92c73ed681abe16d	11-Oct-2012	Simran Basi <sbasi@chromium.org>	Autotest: Have the scheduler fix active and complete queue entries. Currently as part of the scheduler's 24 hour cleanup, it looks for entries that are marked as both active=True and complete=True and if it finds any such entries, it sends out an email that will repeat daily till the database is fixed. In order to address this and have the code self correct the problem, the 24 hour cleanup will now mark the entries as not active as they are completed. BUG=chromium-os:35220 TEST=Tested on my local scheduler where I manually set active=Complete=1 and ensured the error occured and when the fix was put in, it self corrected. Change-Id: I954fb4f93753ae843a0cd3169e96d03d3bc63ba3 Reviewed-on: https://gerrit.chromium.org/gerrit/35292 Commit-Ready: Simran Basi <sbasi@chromium.org> Reviewed-by: Simran Basi <sbasi@chromium.org> Tested-by: Simran Basi <sbasi@chromium.org> /external/autotest/scheduler/monitor_db_cleanup.py
5649cff32e9d9cfb5f686b6b4fa08a81f640b553	13-Aug-2012	Scott Zawalski <scottz@google.com>	Trunce django_session table instead of cleanning it up. The table django_session is used to keep track of connections and reuse sessions that have been created. As it currently stands this is not really respected/used in the Autotest framework and this table just ends up growing to ridiculous sizes with no benefit. Truncating the table offers a quick way of clearing all entries we have accrued with no side effects. As opposed to cleaning up the table which adds processing overhead to the database and lag when restarting the scheduler or when it hits its maintenance window. TEST=Ran truncate on the test server and during RPC/Web page accesses to ensure that we do not impact scripts or page loads. BUG=chromium-os:31608 STATUS=Fixed Change-Id: I600284ef231f2a6943d35e7a72e4fed1328475d7 Reviewed-on: https://gerrit.chromium.org/gerrit/30013 Commit-Ready: Scott Zawalski <scottz@chromium.org> Reviewed-by: Scott Zawalski <scottz@chromium.org> Tested-by: Scott Zawalski <scottz@chromium.org> /external/autotest/scheduler/monitor_db_cleanup.py
e0493a4af57c1a73376a7bafaed542c01f588196	15-Nov-2010	Eric Li <ericli@chromium.org>	Merge remote branch 'cros/upstream' into tempbranch BUG= TEST= Review URL: http://codereview.chromium.org/4823005 Change-Id: I5d56f1c10d0fce7f9d7dc3ad727ea52dcb9b2d6c /external/autotest/scheduler/monitor_db_cleanup.py
675bfe7a50b47c1de15248a3894258b312f2123c	19-Feb-2010	jamesren <jamesren@592f7852-d20e-0410-864c-8624ca9c26a4>	Change email of hosts without platforms to a warning in the logs. We don't do anything about this and it is just extra spam for us at this point in time. This will only affect admins who use the SCHEDULER.notify_email variable Signed-off-by: Scott Zawalski <scottz@google.com> git-svn-id: http://test.kernel.org/svn/autotest/trunk@4258 592f7852-d20e-0410-864c-8624ca9c26a4 /external/autotest/scheduler/monitor_db_cleanup.py
be030fbb848ad116769790ee0ed149409652c501	15-Jan-2010	showard <showard@592f7852-d20e-0410-864c-8624ca9c26a4>	In periodic reverificaiton, use schedule_special_task() instead of straight object creation. This is the right path to use for creating tasks -- it include duplication avoidance and automatic owner tagging. Signed-off-by: Steve Howard <showard@google.com> git-svn-id: http://test.kernel.org/svn/autotest/trunk@4120 592f7852-d20e-0410-864c-8624ca9c26a4 /external/autotest/scheduler/monitor_db_cleanup.py
64a9595406f2884fb3ece241190b10aa054439a9	13-Jan-2010	showard <showard@592f7852-d20e-0410-864c-8624ca9c26a4>	When using Django models from a script, make the current user default to an actual database user named "autotest_system". This allows for simpler, more consistent code. Signed-off-by: Steve Howard <showard@google.com> git-svn-id: http://test.kernel.org/svn/autotest/trunk@4114 592f7852-d20e-0410-864c-8624ca9c26a4 /external/autotest/scheduler/monitor_db_cleanup.py
8dbd05aa0a62a0b17bf4b19131250a8f6cfccf02	12-Jan-2010	showard <showard@592f7852-d20e-0410-864c-8624ca9c26a4>	Implement periodic reverification of dead hosts, configurable in global_config. Implemented as part of the periodic cleanup, so the frequency of reverification is bounded by the periodic cleanup interval. I felt this would be acceptable and putting this in the existing cleanup class makes things more nicely organized. Signed-off-by: Steve Howard <showard@google.com> git-svn-id: http://test.kernel.org/svn/autotest/trunk@4100 592f7852-d20e-0410-864c-8624ca9c26a4 /external/autotest/scheduler/monitor_db_cleanup.py
eab66ce582bfe05076ff096c3a044d8f0497bbca	23-Dec-2009	showard <showard@592f7852-d20e-0410-864c-8624ca9c26a4>	Rename the tables in the databases, by prefixing the app name. This is in preparation for merging the two databases and the two Django projects into one. Note that this renames all standard Autotest DB tables in both the autotest_web and tko databases. If you have scripts written directly against these databases, they will break. If your scripts access the RPC interfaces, they should continue to work. Another patch will be along within the next few weeks to actually move the TKO tables into the autotest_web database. From: James Ren <jamesren@google.com> Signed-off-by: Steve Howard <showard@google.com> Rename the tables in the databases, by prefixing the app name. This is in preparation for merging the two databases and the two Django projects into one. Note that this renames all standard Autotest DB tables in both the autotest_web and tko databases. If you have scripts written directly against these databases, they will break. If your scripts access the RPC interfaces, they should continue to work. From: James Ren <jamesren@google.com> Signed-off-by: Steve Howard <showard@google.com> git-svn-id: http://test.kernel.org/svn/autotest/trunk@4040 592f7852-d20e-0410-864c-8624ca9c26a4 /external/autotest/scheduler/monitor_db_cleanup.py
7ca9e01f5ef84af6e4f0649d8291e05ee158e833	10-Nov-2009	showard <showard@592f7852-d20e-0410-864c-8624ca9c26a4>	Remove the synch_job_start_timeout_minutes scheduler "feature" as it is pretty much broken by design as is by being based off of the job create time rather than the time the job's hosts went into Pending. Its not being used so its easier to remove it. Signed-off-by: Gregory Smith <gps@google.com> git-svn-id: http://test.kernel.org/svn/autotest/trunk@3921 592f7852-d20e-0410-864c-8624ca9c26a4 /external/autotest/scheduler/monitor_db_cleanup.py
dabf6cf739c42c5641c89f083ce697617ec82d82	09-Sep-2009	showard <showard@592f7852-d20e-0410-864c-8624ca9c26a4>	It is okay for hosts to have multiple atomic group labels so long as all of them are associated with the same atomic group. Don't whine in this situation. Signed-off-by: Gregory Smith <gps@google.com> git-svn-id: http://test.kernel.org/svn/autotest/trunk@3675 592f7852-d20e-0410-864c-8624ca9c26a4 /external/autotest/scheduler/monitor_db_cleanup.py
1ef218db76c473c28627377d8f50d6e6c6743289	03-Aug-2009	mbligh <mbligh@592f7852-d20e-0410-864c-8624ca9c26a4>	This is the result of a batch reindent.py across our tree. As Martin pointed out, we ought to be more careful and create a pre-svn commit script to avoid inserting trash in the tree, meanwhile, this is a good start to cleanup things Signed-off-by: Lucas Meneghel Rodrigues <lmr@redhat.com> git-svn-id: http://test.kernel.org/svn/autotest/trunk@3487 592f7852-d20e-0410-864c-8624ca9c26a4 /external/autotest/scheduler/monitor_db_cleanup.py
6157c63947d2d628d187a084acb0a48473af1c79	06-Jul-2009	showard <showard@592f7852-d20e-0410-864c-8624ca9c26a4>	Make the scheduler robust to finding a HostQueueEntry with more than one atomic group label. Log a detailed error message and continue rather than bailing out with a SchedulerError. Signed-off-by: Gregory Smith <gps@google.com> git-svn-id: http://test.kernel.org/svn/autotest/trunk@3373 592f7852-d20e-0410-864c-8624ca9c26a4 /external/autotest/scheduler/monitor_db_cleanup.py
01a5167f13d9788c9f359ebba31358e329c98ebc	29-May-2009	showard <showard@592f7852-d20e-0410-864c-8624ca9c26a4>	Have the scheduler check for and sometimes cleanup various DB inconsistencies. * in periodic cleanup, check for relationships to invalidated objects, and remove them (and send notification email) * in 24hr cleanup, check for hosts with != 1 platform, and send notification email Also changed AFE models to have labels remove associations with tests (as dependencies) when deleted (invalidated). Signed-off-by: Steve Howard <showard@google.com> git-svn-id: http://test.kernel.org/svn/autotest/trunk@3185 592f7852-d20e-0410-864c-8624ca9c26a4 /external/autotest/scheduler/monitor_db_cleanup.py
12f3e3212795a539d95973f893ac570e669e3a22	13-May-2009	showard <showard@592f7852-d20e-0410-864c-8624ca9c26a4>	Add job maximum runtime, a new per-job timeout that counts time since the job actually started. * added started_on field to host_queue_entries, so that we could actually compute this timeout * added max_runtime_hrs to jobs, with default in global config, and added option to create_job() RPC * added the usual controls to AFE and the CLI for the new job option * added new max runtime timeout method to * added migration to add new fields and set a safe default max runtime for existing jobs Signed-off-by: Steve Howard <showard@google.com> git-svn-id: http://test.kernel.org/svn/autotest/trunk@3132 592f7852-d20e-0410-864c-8624ca9c26a4 /external/autotest/scheduler/monitor_db_cleanup.py
915958db04ca97d3d5a011383e736a3e2b4e8db3	22-Apr-2009	showard <showard@592f7852-d20e-0410-864c-8624ca9c26a4>	Fix monitor_db_unittest, broken by previous change to refactor cleanup code. Two main things here: * 24hr cleanup was running upon object construction, which meant it was running inadvertently during unit testing. Fixed this with the usual trick of moving that action from the constructor to an initialize() function, which gets called separately in monitor_db and which the unit test avoids. * one of the scheduler unit tests was actually testing cleanup code; change that to call the newly located function. this test should maybe be moved to a separate unit test file for the monitor_db_cleanup module, but I just want to get things working again for now. Signed-off-by: Steve Howard <showard@google.com> git-svn-id: http://test.kernel.org/svn/autotest/trunk@3029 592f7852-d20e-0410-864c-8624ca9c26a4 /external/autotest/scheduler/monitor_db_cleanup.py
f3294cce1590d9c79cb25dcaa18cec0ac08c9b73	08-Apr-2009	mbligh <mbligh@592f7852-d20e-0410-864c-8624ca9c26a4>	Move clean up functions into seperate file/classes Add 24hour clean up run Add django_session clean up Signed-off-by: Scott Zawalski <scottz@google.com> git-svn-id: http://test.kernel.org/svn/autotest/trunk@2979 592f7852-d20e-0410-864c-8624ca9c26a4 /external/autotest/scheduler/monitor_db_cleanup.py