Re: [jakartaee-tck-dev] [glassfish-dev] Tracking usage data for EE4J wor

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index] [List Home]

Re: [jakartaee-tck-dev] [glassfish-dev] Tracking usage data for EE4J working group CI cloud systems

From: Scott Marlow <smarlow@xxxxxxxxxx>
Date: Wed, 30 Sep 2020 14:11:37 -0400
Delivered-to: jakartaee-tck-dev@xxxxxxxxxxx
List-archive: <https://www.eclipse.org/mailman/private/jakartaee-tck-dev>
List-help: <mailto:jakartaee-tck-dev-request@eclipse.org?subject=help>
List-subscribe: <https://www.eclipse.org/mailman/listinfo/jakartaee-tck-dev>, <mailto:jakartaee-tck-dev-request@eclipse.org?subject=subscribe>
List-unsubscribe: <https://www.eclipse.org/mailman/options/jakartaee-tck-dev>, <mailto:jakartaee-tck-dev-request@eclipse.org?subject=unsubscribe>
User-agent: Mozilla/5.0 (X11; Linux x86_64; rv:68.0) Gecko/20100101 Thunderbird/68.11.0

Here are the average + max Memory/#CpuCores:

avg memory.limit    Max Memory        average cpu limits Max CPU
=====                   ===== ======                    ========
61.58 Gi 378.00 Gi 12.1vCPU 74.7 vCPU
There are some cpu/memory limits in Jenkinsfile(https://github.com/eclipse-ee4j/jakartaee-tck/blob/master/Jenkinsfile#L147),each memory limit is specifying the container/VM memory size (since wedidn't specify the initial memory request setting), so the calculationis something like:
memory usage = 10Gi per VM * number of test groups

CPU core = 2 * number of test groups
The data-capture does give us a high level view of what the containerlevel memory/CPU core usage has been. Quoting from a previous TCK mlconversation (from David Blevins with subject: "Resource PackAllocations & Maximizing Use"):
"
Over all of EE4J we have 105 resource packs paid for that give us atotal of 210 cpu cores and 840 GB RAM. These resource packs arededicated, not elastic. The actual allocation of 105 resource packs isby project. The biggest allocation is 50 resource packs toee4j.jakartaee-tck (this project), the second biggest is 15 resourcepacks to ee4j.glassfish.
The most critical takeaway from the above is we have 50 resource packsdedicated to this project giving us a total of 100 cores and 400GB ramat our disposal 24x7. These 50 are bought and paid for -- we do notsave money if we don't use them.
"
So, the Platform TCK is budgeted to use 100 cores and 400GB ram,however, we haven't used more than 75 CPU cores and 378gb of memory (asper numbers max memory/cpu numbers pasted above).
I think the fundamental question is: can we manage this resource,hence the cost, based on these data?
Imo, I think there is memory/cpu tuning that we could do if there istime to experiment before answers are needed regarding current usageversus what usage could be.

Alwin helped me to create a Platform TCK runner job that can run againstmy github repository. Thanks Alwin!

I created https://github.com/scottmarlow/jakartaee-tck/tree/tuning torepresent changes to improve our memory/cpu tuning.

When we have time to try memory/cpu tuning improvements, we can runtests withhttps://ci.eclipse.org/jakartaee-tck/job/jakartaee-tck-scottmarlowagainst the `tuning` branch. Pull requests are welcome! :-)

So, I think this identifies the `how we can try making improvements toour usage`. I'm also hoping that reducing our memory/cpu usage cantranslate into being able to run more concurrent tests at the same time.

Currently, we also have to avoid starting multiple Platform TCK testruns at the same time or we hit test stability problems (GlassFish won'tstart correctly for some tests).

You are also welcome to review any of the commentary and ask questionsdirectly via the issue.
I asked on https://bugs.eclipse.org/bugs/show_bug.cgi?id=565098 aboutmeasuring usage for a weekend or over a few days.

The answer is that the measuring is always on and can be observed as perlinks mentioned in the bugzilla issue. This will require some dancingas we need to ensure that no other tests are run the same day (untilafter we have noted the usage for the `tuning` test run). This isimportant so that we have a way to compare use of different settings.

I'm not sure of when we will have time to do this testing yet but wouldbe nice to fit it in.


Scott

Follow-Ups:
- Re: [jakartaee-tck-dev] [glassfish-dev] Tracking usage data for EE4J working group CI cloud systems
  - From: Ed Bratt
- Re: [jakartaee-tck-dev] [glassfish-dev] Tracking usage data for EE4J working group CI cloud systems
  - From: Scott Marlow

References:
- [jakartaee-tck-dev] Tracking usage data for EE4J working group CI cloud systems
  - From: Ed Bratt
- Re: [jakartaee-tck-dev] [glassfish-dev] Tracking usage data for EE4J working group CI cloud systems
  - From: Scott Marlow

Prev by Date: Re: [jakartaee-tck-dev] Glassfish nightly build issue
Next by Date: Re: [jakartaee-tck-dev] [glassfish-dev] Tracking usage data for EE4J working group CI cloud systems
Previous by thread: Re: [jakartaee-tck-dev] [glassfish-dev] Tracking usage data for EE4J working group CI cloud systems
Next by thread: Re: [jakartaee-tck-dev] [glassfish-dev] Tracking usage data for EE4J working group CI cloud systems
Index(es):
- Date
- Thread

Breadcrumbs