2022-08-31
Kong update w Felipe
hard limit of 120 rules per ASG
- if had done portal would have seen already so need to fix
cluster and cluster telemetry need load balancer
proxy nice to have LB
what about classic LB (sim. to ALB)
all now working
infra running ok
moving sandbox to blue, then nonprod, prod
then I am to commission RDS and the rest follows
should have had dev portal by w/c 29/08
Health
- had 4 sets of APIs in one environment
- deck removing one as deployed the next -> deck-sanity
moving from blue to green
willshould be justdeck dump,deck apply
merge FG PRresolve remaining conflictslook at validation
- what does
workspaces.yamldp_fqdnmean? - nothing.
- invert? what data plane targetting from workspace
- A workspace publishes APIs in 0..n data planes
- what does
commission RDS
draft PR per environment for new workspaces.yaml and get everyone to approveC3
- Platman 1.0
- Garrett promises slides to follow
- no more pause reconciler
- no more continuous reconciliation because don’t know state
- move to event based reconciler, two triggers
- push by team
- push by CE (notified)
- managed node groups
- asg groups managed by AWS
2022-08-19
- CEIP-2173
- GitHub actions implementation done and merged
- The Book
- For those interested this is my WIP on the “Cortex book”: https://GitHub.com/elsevier-centraltechnology/cortex-documentation/pull/3/files
- currently focussing on layout/format and overview section
- if you have any overview worthy content I’d be interested 🙂
- the other question from me: does the content help you understand Cortex?
- For those interested this is my WIP on the “Cortex book”: https://GitHub.com/elsevier-centraltechnology/cortex-documentation/pull/3/files
2022-08-18
- dev duty
- CEIP-2176:workspaces.yaml validation
python3 -c 'import json, sys, yaml; from jsonschema import validate; validate(instance=yaml.safe_load(open("workspaces.yaml")),schema=json.load(open("workspaces.schema.json")))'- first with Tekton
- then with GitHub actions
- Conversation with Len
- Prod Mgmt
- James Coates (DBS) have their own TIO, more directed towards Cortex
- The other BUs have cortex mostly tio-led and hence less keen or able to influence
- Tio cross-bills for DevOps so a good pitch for cortex is to save budget for prod dev’t
- Speak to Irfan about ‘the book’
- Prod Mgmt
2022-08-17
- CEIP-2175: contacts teams and writing blog post
2022-08-16
- many meetings about whether to migrate or recreate clean Kong database
2022-08-10
- CEIP-2120: delete workspace
- new from Kong support:
kong migrations reinitialize-workspace-entity-counters
I think the only way is to do this in postgres: Take a backup+ /docker-entrypoint.sh kong migrations reinitialize-workspace-entity-counters Error: admin_gui_session_conf must be set when admin_gui_auth is enabledselect id from workspaces where name = 'here'; update workspace_entity_counters set count = 0 where workspace_id = 'id_here';
- new from Kong support:
2022-08-09
- CEIP-1995: testing and PR for nonprod/prod
- DevDuty
- CEIP-2120: delete workspace
- Kong decK tool: https://docs.konghq.com/deck/latest/installation/
./deleteWorkspace.sh Test https://infra.kong-nonprod.cortex.elsevier.systems/_api/ <TOKEN>- TOKEN comes from AWS Parameter Store (sandbox)
2022-08-08
- AM: CEIP-2120: working w JB on the deletion of workspaces
- PM: CEIP-1995: working w AN on the resultant Jenkins agent crash
2022-08-03
- CEIP-2130
- what about a k8s crontab?
2022-08-03
- Account compliance dashboard mtg:
- suggestions: JSON schema; pydantic instead of dict.
- https://GitHub.com/elsevier-centraltechnology/core-account-compliance-dashboard
- CEIP-2130
- investigated GitHub Apps example
- local server doesn’t start on Linux
mnt/data/git/GitHub-app-webhook-monitor/vendor/bundle/ruby/3.0.0/gems/mustermann-1.0.3/lib/mustermann/regular.rb:22:in `initialize': wrong number of arguments (given 2, expected 1) (ArgumentError)
- local server doesn’t start on Linux
- investigated GitHub Apps example
from /mnt/data/git/GitHub-app-webhook-monitor/vendor/bundle/ruby/3.0.0/gems/mustermann-1.0.3/lib/mustermann/pattern.rb:59:in new' ``` - smee doesn't work with ZScaler (have to exit not merely turn off) - ruby 2.6 gives this error: ``` /Users/stephensont/git/GitHub-app-template/vendor/bundle/ruby/2.6.0/gems/octokit-4.13.0/lib/octokit/middleware/follow_redirects.rb:14:in module:Middleware’: uninitialized constant Faraday::Error::ClientError (NameError)
Did you mean? Faraday::ClientError
```
and brew installing Ruby 3.1 fails as on Linux
- CONCLUSION: GitHub apps are more trouble than they’re worth
- CEIP-1995
- how far has rollout gone?
- JVM parameters (Jenkins on Java 11)
2022-08-02
- CEIP-2130 Jenkins envelope ok, I can build a GitHub App to monitor failures and do something (retry on a fibonacci delay and / escalate on slack) or argue the case with Matteo or fork the GHF to queue the requests ?
- CEIP-1995 and CEIP-2137: Jenkins outage and memory resource changes
2022-08-01
- dev duty
- ticket for Jenkins
2022-07-26
- catchup
- CEIP-2085
- CEIP-2052: Review doc for Felipe
- CEIP-2091 (Thomas Tran)
- Wiki page edited by Rob L (Health)
- mtg plan
- Review: https://GitHub.com/elsevier-centraltechnology/core-cicd-reference/pull/6/
- terraform control repository is a Elsevier idea or broader?
- this is one of the 3 repositories I was going to document in a blog post
- Planning
- Dev portal: tickets to be agreed w Felipe
- Jenkins: Shim w SQS or whatever (liaise w Felipe)