ayounsi (Arzhel Younsi)
Staff Network SRE

Projects (10)

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Wednesday

  • Clear sailing ahead.

User Details

User Since
Apr 3 2017, 6:23 PM (378 w, 6 d)
Availability
Available
IRC Nick
xionox
LDAP User
Ayounsi
MediaWiki User
AYounsi (WMF) [ Global Accounts ]

Recent Activity

Today

ayounsi moved T369274: Move IP gateways for codfw row C/D vlans to EVPN Anycast GW from Backlog to This quarter on the netops board.
Mon, Jul 8, 8:15 AM · netops, Infrastructure-Foundations, SRE
ayounsi moved T365993: Upgrade EVPN switches Eqiad row E-F to JunOS 22.2 - lsw1-e1-eqiad from Backlog to This quarter on the netops board.
Mon, Jul 8, 7:24 AM · SRE-swift-storage, DBA, Data-Persistence, Infrastructure-Foundations, netops, SRE
ayounsi moved T365995: Upgrade EVPN switches Eqiad row E-F to JunOS 22.2 - lsw1-e3-eqiad from Backlog to This quarter on the netops board.
Mon, Jul 8, 7:24 AM · SRE-swift-storage, DBA, Data-Persistence, Infrastructure-Foundations, netops, SRE
ayounsi moved T365997: Upgrade EVPN switches Eqiad row E-F to JunOS 22.2 -lsw1-f2-eqiad from Backlog to This quarter on the netops board.
Mon, Jul 8, 7:24 AM · SRE-swift-storage, DBA, Data-Persistence, Infrastructure-Foundations, netops, SRE
ayounsi moved T365996: Upgrade EVPN switches Eqiad row E-F to JunOS 22.2 - lsw1-f1-eqiad from Backlog to This quarter on the netops board.
Mon, Jul 8, 7:24 AM · SRE-swift-storage, DBA, Data-Persistence, Infrastructure-Foundations, netops, SRE
ayounsi moved T365998: Upgrade EVPN switches Eqiad row E-F to JunOS 22.2 -lsw1-f3-eqiad from Backlog to This quarter on the netops board.
Mon, Jul 8, 7:24 AM · SRE-swift-storage, DBA, Data-Persistence, Infrastructure-Foundations, netops, SRE
ayounsi closed T363895: BGP status (instance cr2-eqord) - April 2024 - Equinix peering AS15830 as Resolved.
Mon, Jul 8, 7:19 AM · netops, Infrastructure-Foundations
ayounsi moved T369351: Model GRE tunnels in Netbox from Backlog to This quarter on the netops board.
Mon, Jul 8, 7:18 AM · Infrastructure-Foundations, netops, SRE
ayounsi moved T364095: Codfw row C/D switch installation & configuration from Backlog to This quarter on the netops board.
Mon, Jul 8, 7:18 AM · DC-Ops, ops-codfw, netops, Infrastructure-Foundations, SRE
ayounsi moved T362772: ASW single-point of failure for LVS VIPs at POPs from Backlog to Watching on the netops board.
Mon, Jul 8, 7:18 AM · Patch-For-Review, Traffic, SRE, netops, Infrastructure-Foundations
ayounsi moved T360789: codfw row C/D upgrade racking task from Backlog to Watching on the netops board.
Mon, Jul 8, 7:18 AM · SRE, Infrastructure-Foundations, netops, ops-codfw, DC-Ops
ayounsi moved T348977: Upgrade EVPN switches Eqiad row E-F to JunOS 22.2 from Backlog to This quarter on the netops board.
Mon, Jul 8, 7:17 AM · Infrastructure-Foundations, netops, SRE
ayounsi moved T367512: Get test host connected to codfw row c/d lsw's from Backlog to Watching on the netops board.
Mon, Jul 8, 7:03 AM · DC-Ops, ops-codfw, SRE, netops, Infrastructure-Foundations
ayounsi moved T368545: weighted maglev viability for low-traffic services from Backlog to Watching on the netops board.
Mon, Jul 8, 7:00 AM · Infrastructure-Foundations, netops, serviceops, Traffic

Fri, Jul 5

ayounsi added a comment to T362392: Routed Ganeti: Add support for VM BGP.

Enable/test BFD between Ganeti and its VMs

Adding the BFD statement works fine for v4, but on the hypervisor side I don't think it can be added for v6 in the current state of things.

Fri, Jul 5, 8:52 AM · Patch-For-Review, Ganeti
ayounsi added a parent task for T369341: Some VRTS emails to Gmail accounts fail the SPF check: T355764: Ensure that VRTS (ticket.wikimedia.org) adheres to Google's sender guidelines.
Fri, Jul 5, 7:06 AM · Infrastructure-Foundations, collaboration-services, vrts
ayounsi added a subtask for T355764: Ensure that VRTS (ticket.wikimedia.org) adheres to Google's sender guidelines: T369341: Some VRTS emails to Gmail accounts fail the SPF check.
Fri, Jul 5, 7:06 AM · collaboration-services
ayounsi reopened T355764: Ensure that VRTS (ticket.wikimedia.org) adheres to Google's sender guidelines, a subtask of T355712: Ensure Wikimedia complies with Google's new email sender guidelines, as Open.
Fri, Jul 5, 7:06 AM · Foundational Technology Requests
ayounsi reopened T355764: Ensure that VRTS (ticket.wikimedia.org) adheres to Google's sender guidelines as "Open".

Reopening because of T369341: Some VRTS emails to Gmail accounts fail the SPF check

Fri, Jul 5, 7:06 AM · collaboration-services

Thu, Jul 4

ayounsi added a comment to T366193: Anycast ns1.wikimedia.org.

Possible, but complicated and fuzzy as to the effects on different scenarios. If we "weight" a single anycast/24 by loading up several distinct NS IPs from it, it also has an outsized negative impact anytime the unicast fallback was really needed (if the anycast isn't working from this client cache POV). Also, doesn't scale well, as it starts inflating response packet sizes for NS lists (esp in a DNSSEC future), and causing general concerns about amplification.

Thanks that makes sens, I'm wondering how would compares with the other proposal in term or cost/complexity/latency/availability.

Thu, Jul 4, 3:07 PM · SRE, Traffic
ayounsi added a comment to T362392: Routed Ganeti: Add support for VM BGP.

The above patch should workaround the issue for v6 (based on @cmooney's testing)

Thu, Jul 4, 1:02 PM · Patch-For-Review, Ganeti
ayounsi closed T367265: Capirca setup for routed Ganeti VMs as Resolved.

Import script fixed and orphaned IP for the now deleted sretest2005 removed.

Thu, Jul 4, 8:27 AM · Patch-For-Review, Infrastructure-Foundations, netops
ayounsi committed rOSNEc1896766f195: PuppetDB import: don't treat /32-/128 VM interfaces as VIPs.
PuppetDB import: don't treat /32-/128 VM interfaces as VIPs
Thu, Jul 4, 8:25 AM

Wed, Jul 3

ayounsi closed T362421: magru network setup as Resolved.

All is done here.

Wed, Jul 3, 3:56 PM · Patch-For-Review, netops, SRE, Infrastructure-Foundations
ayounsi closed T362421: magru network setup, a subtask of T346722: Sao Paulo, Brazil, South America POP tracking task, as Resolved.
Wed, Jul 3, 3:53 PM · ops-magru
ayounsi closed T362330: Routed Ganeti : Add support for public IPs as Resolved.
Wed, Jul 3, 10:38 AM · Patch-For-Review, Ganeti
ayounsi added a comment to T362330: Routed Ganeti : Add support for public IPs.

Confirmed working:

install2004
Jul  3 08:47:14 install2004 dhcpd[3728660]: DHCPDISCOVER from aa:00:00:f4:44:8d via 10.192.6.6
Jul  3 08:47:14 install2004 dhcpd[3728660]: DHCPOFFER on 208.80.152.130 to aa:00:00:f4:44:8d via 10.192.6.6
Jul  3 08:47:15 install2004 dhcpd[3728660]: DHCPDISCOVER from aa:00:00:f4:44:8d via 10.192.6.6
Jul  3 08:47:15 install2004 dhcpd[3728660]: DHCPOFFER on 208.80.152.130 to aa:00:00:f4:44:8d via 10.192.6.6
Jul  3 08:47:17 install2004 dhcpd[3728660]: DHCPREQUEST for 208.80.152.130 (208.80.153.105) from aa:00:00:f4:44:8d via 10.192.6.6
Jul  3 08:47:17 install2004 dhcpd[3728660]: DHCPACK on 208.80.152.130 to aa:00:00:f4:44:8d via 10.192.6.6
Jul  3 08:47:18 install2004 atftpd[479]: Serving lpxelinux.0 to 208.80.152.130:13292
ganeti2034
$ ip addr show dev tap1
23: tap1: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc fq_codel state UNKNOWN group default qlen 1000
    link/ether 22:22:22:22:22:01 brd ff:ff:ff:ff:ff:ff
    inet 208.80.152.129/32 scope link tap1
       valid_lft forever preferred_lft forever
    inet6 fe80::2022:22ff:fe22:2201/64 scope link 
       valid_lft forever preferred_lft forever
$ ip route show dev tap1
208.80.152.130 proto static scope link 
Wed, Jul 3, 10:38 AM · Patch-For-Review, Ganeti
ayounsi updated the task description for T369136: Package ipxe-qemu.
Wed, Jul 3, 8:31 AM · User-Elukey, Packaging, Infrastructure-Foundations
ayounsi created T369136: Package ipxe-qemu.
Wed, Jul 3, 8:29 AM · User-Elukey, Packaging, Infrastructure-Foundations
ayounsi updated the task description for T362330: Routed Ganeti : Add support for public IPs.
Wed, Jul 3, 5:58 AM · Patch-For-Review, Ganeti
ayounsi updated the task description for T362330: Routed Ganeti : Add support for public IPs.
Wed, Jul 3, 5:57 AM · Patch-For-Review, Ganeti

Mon, Jul 1

ayounsi closed T368499: magru ipv6 issues as Resolved.
Mon, Jul 1, 2:37 PM · Infrastructure-Foundations, netops

Fri, Jun 28

ayounsi added a comment to T367973: Replace ping offload servers with eBPF.

Indeed, amazing ! Just a few lines of code to replace multiple VMs and router policies :)

Fri, Jun 28, 7:18 AM · Traffic
ayounsi added a comment to T368544: IPIP encapsulation considerations for low-traffic services.

IPIP encapsulation is a necessary step in the good direction, whatever solution we decide on for load balancing, for the reasons mentioned by Cathal and Valentin. As data point, the vxlan license is an extra 100k for a 10 racks setup (plus yearly support).

Fri, Jun 28, 7:13 AM · Infrastructure-Foundations, serviceops, netops, Traffic

Thu, Jun 27

ayounsi added a comment to T368545: weighted maglev viability for low-traffic services.

Strictly on the network side, there is no blocker one way or the other.

Thu, Jun 27, 7:17 AM · Infrastructure-Foundations, netops, serviceops, Traffic

Wed, Jun 26

ayounsi committed rOSNEa71d05fc4991: Netbox puppet import: ignore ipip interfaces.
Netbox puppet import: ignore ipip interfaces
Wed, Jun 26, 6:49 AM
ayounsi committed rOSNE588cc7c85ca5: Netbox puppet import: ignore ipip interfaces.
Netbox puppet import: ignore ipip interfaces
Wed, Jun 26, 6:47 AM
ayounsi created T368499: magru ipv6 issues.
Wed, Jun 26, 6:31 AM · Infrastructure-Foundations, netops

Tue, Jun 25

ayounsi added a comment to T367439: No unicast IP ranges announced to peers from eqdfw.

Your proposal seems good to me.

Tue, Jun 25, 1:15 PM · Patch-For-Review, Infrastructure-Foundations, netops, SRE
ayounsi committed rOSNEa992a10d9059: move_server.py: fix function call name (authored by elukey).
move_server.py: fix function call name
Tue, Jun 25, 8:30 AM
ayounsi committed rOSNE2f02e16a7e03: Netbox 4: rename device_role to role in validators.
Netbox 4: rename device_role to role in validators
Tue, Jun 25, 8:19 AM
ayounsi committed rOSNE04e3d5045c17: Netbox 4: fix cable terminations breaking changes.
Netbox 4: fix cable terminations breaking changes
Tue, Jun 25, 8:19 AM

Mon, Jun 24

ayounsi committed rOSNEc6c39bcfd960: Netbox 4: fix breaking change around connected_endpoints.
Netbox 4: fix breaking change around connected_endpoints
Mon, Jun 24, 8:48 PM
ayounsi committed rOSNEa2431147c586: Netbox 4: scripts self.log renamed self.messages.
Netbox 4: scripts self.log renamed self.messages
Mon, Jun 24, 4:08 PM
ayounsi added a comment to T336275: Upgrade Netbox to 4.x.

Opened https://github.com/netbox-community/netbox/issues/16698 for a Netbox regression on how it handles Scripts compared to... 3.2.9

Mon, Jun 24, 1:56 PM · Patch-For-Review, Infrastructure-Foundations, netbox
ayounsi committed rOSNE76c69b2f324a: Netbox 4: fix ColorChoices import.
Netbox 4: fix ColorChoices import
Mon, Jun 24, 8:46 AM

Fri, Jun 21

ayounsi created P65323 Netbox 4.0.3 Jobs page error.
Fri, Jun 21, 3:35 PM
ayounsi committed rOSNE8577f6e3766a: Netbox 4: replace `device_role` with `role`.
Netbox 4: replace `device_role` with `role`
Fri, Jun 21, 2:59 PM
ayounsi updated subscribers of T336275: Upgrade Netbox to 4.x.

It's not possible to the the DB migration directly from 3.2.9 to 4.x. We need to do a pit-stop on 3.7.x.

Fri, Jun 21, 1:21 PM · Patch-For-Review, Infrastructure-Foundations, netbox
ayounsi updated the task description for T336275: Upgrade Netbox to 4.x.
Fri, Jun 21, 9:15 AM · Patch-For-Review, Infrastructure-Foundations, netbox
ayounsi committed rOSNEc16161e0fb06: Netbox 4: getstats.py.
Netbox 4: getstats.py
Fri, Jun 21, 9:03 AM

Thu, Jun 20

ayounsi committed rOSNE3391d63fe21c: Netbox-extra: Add bandit and prospector to CI.
Netbox-extra: Add bandit and prospector to CI
Thu, Jun 20, 11:45 AM
ayounsi committed rOSNEd50e85c312c5: Fix lots of CI errors.
Fix lots of CI errors
Thu, Jun 20, 11:40 AM
ayounsi updated the task description for T336275: Upgrade Netbox to 4.x.
Thu, Jun 20, 8:00 AM · Patch-For-Review, Infrastructure-Foundations, netbox

Wed, Jun 19

ayounsi added a comment to T336275: Upgrade Netbox to 4.x.

Some notes before I forget, to make the sre.deploy.python-code work I had to:

Wed, Jun 19, 4:43 PM · Patch-For-Review, Infrastructure-Foundations, netbox
ayounsi created T367973: Replace ping offload servers with eBPF.
Wed, Jun 19, 1:12 PM · Traffic
Syaifulnizamshamsudin awarded Blog Post: Ganeti on modern network design a Manufacturing Defect? token.
Wed, Jun 19, 12:47 PM

Mon, Jun 17

ayounsi removed projects from T367056: Rise in ms-fe2* TCP retransmits since 11:40 UTC today : Infrastructure-Foundations, netops.

We had a quick look at the network side and couldn't find any smoking gun.

Mon, Jun 17, 3:19 PM · Traffic, SRE, SRE-swift-storage
ayounsi claimed T367265: Capirca setup for routed Ganeti VMs.
Mon, Jun 17, 3:05 PM · Patch-For-Review, Infrastructure-Foundations, netops
ayounsi triaged T367731: drmrs/esams/magru LVS : remove cross-rack links as Low priority.
Mon, Jun 17, 3:05 PM · netops, Traffic, Infrastructure-Foundations
ayounsi triaged T367732: POPs LVS : remove public vlan trunking as Low priority.
Mon, Jun 17, 3:04 PM · netops, Traffic, Infrastructure-Foundations
ayounsi added a comment to T367731: drmrs/esams/magru LVS : remove cross-rack links.

Of course ! not planning on doing it today :) The task is there to not forget.

Mon, Jun 17, 1:49 PM · netops, Traffic, Infrastructure-Foundations
ayounsi created T367732: POPs LVS : remove public vlan trunking.
Mon, Jun 17, 11:13 AM · netops, Traffic, Infrastructure-Foundations
ayounsi created T367731: drmrs/esams/magru LVS : remove cross-rack links.
Mon, Jun 17, 11:05 AM · netops, Traffic, Infrastructure-Foundations
ayounsi added a comment to T250415: Homer: add parallelization support.

Yeah I think it's what I tried to mean with

We can also decide that batch means to silently skip any device that have a different diff, to not risk blocking the run in the middle of it if a device have local changes

Basically decide if the batch behavior is (3) or (4) and then stick to it. 4 options seems a bit too much.
I tend to prefer (3), and would be ok to not support (4), especially as in a good state there should be no local changes.

Mon, Jun 17, 9:06 AM · User-Elukey, Infrastructure-Foundations, SRE-tools, homer
ayounsi added a comment to T366205: codfw:(3) wikikube-ctrl NIC upgrade to 10G.

I don't understand why the need to be moved to get upgraded to 10G. If we take for example wikikube-ctrl2001 the switch in rack B6 have plenty of available/ready to use 10G ports (for example 44-47).

Mon, Jun 17, 9:01 AM · SRE-OnFire, Sustainability (Incident Followup), serviceops, ops-codfw, DC-Ops
ayounsi added a comment to T367408: Should we channelize unused QSFP28 ports on QFX5120s to provide 'buffer' for 10G upgrades?.

Can we move the cables instead of moving the servers ?

Mon, Jun 17, 8:54 AM · netops, Infrastructure-Foundations, SRE
ayounsi added a comment to T250415: Homer: add parallelization support.

It's necessary to do the diff on all target devices anyway, so that behavior is fine.

Mon, Jun 17, 8:36 AM · User-Elukey, Infrastructure-Foundations, SRE-tools, homer

Jun 7 2024

ayounsi updated the task description for T336275: Upgrade Netbox to 4.x.
Jun 7 2024, 8:35 AM · Patch-For-Review, Infrastructure-Foundations, netbox
ayounsi updated the task description for T366874: Netbox: accounting report failure.
Jun 7 2024, 6:49 AM · Infrastructure-Foundations, netbox
ayounsi created T366874: Netbox: accounting report failure.
Jun 7 2024, 6:15 AM · Infrastructure-Foundations, netbox
ayounsi triaged T366864: cr2-eqdfw: PEM 0 Input Voltage Out Of Range as High priority.
Jun 7 2024, 4:53 AM · SRE, ops-eqdfw, DC-Ops

Jun 6 2024

ayounsi updated the task description for T336275: Upgrade Netbox to 4.x.
Jun 6 2024, 8:21 AM · Patch-For-Review, Infrastructure-Foundations, netbox
ayounsi updated the task description for T336275: Upgrade Netbox to 4.x.
Jun 6 2024, 8:14 AM · Patch-For-Review, Infrastructure-Foundations, netbox

Jun 5 2024

ayounsi added a comment to T336275: Upgrade Netbox to 4.x.

Plan so far is to merge https://gerrit.wikimedia.org/r/1037784 to be able to have a puppetized test server compatible with the new deploy directory scheme (netbox-dev)
Then to merge https://gerrit.wikimedia.org/r/1038694 and check it out from /srv/deployment/netbox-dev/deploy
Then load a copy of the prod Netbox DB on the dev instance pbsql
Then Run the deploy python code cookbook to have a working Netbox 4 setup (and fix any issue that could prevent it)
Then check if the DB migration went well
In parallel merge https://gerrit.wikimedia.org/r/c/operations/software/netbox-extras/+/905570/ and parent change to have better CI on netbox-extra ahead of fixing all the Netbox 4 breaking changes.
Then send/merge patches to fix those netbox-extra changes.
Then (non blocker) Update the sre.netbox.update-extras cookbook to account for those changes.
Then send Spicerack, Cookbooks and Homer patches to fix Netbox's breaking changes. Ideally by moving some of the Cookbook's Netbox API calls to Spicerack.

Jun 5 2024, 2:09 PM · Patch-For-Review, Infrastructure-Foundations, netbox

Jun 3 2024

ayounsi added a comment to T366360: Anycast NTP and update the list of timeservers for P:systemd::timesyncd.

Last time we rolled out this change, it was simply updating modules/install_server/files/autoinstall/common.cfg. Do you have any other place in mind where this might need to be reconfigured? I am personally for removing this completely but it's not a big deal and we can keep it around as well.

https://wikitech.wikimedia.org/wiki/SRE/Dc-operations/Platform-specific_documentation/Opengear_Serial_Consoles
https://wikitech.wikimedia.org/wiki/SRE/Dc-operations/Platform-specific_documentation/ServerTech

Jun 3 2024, 3:45 PM · SRE, Traffic
ayounsi closed T362523: Juniper: use export-format state-data json compact as Resolved.

Our engineering team has now indicated that the compact json is not supported, due to hardware limitations with respect to compact json formatting. The feature will be deprecated in Junos 24.4. So, please do not use compact json to export data.

Jun 3 2024, 1:30 PM · Infrastructure-Foundations, netops
ayounsi added a comment to T366360: Anycast NTP and update the list of timeservers for P:systemd::timesyncd.

Moving the dynamic nature of NTP definition to some automated system instead of human or Puppet is a great idea :)
Human as in right now for network devices, the list is hard-coded https://github.com/wikimedia/operations-homer-public/blob/master/config/common.yaml#L365

Jun 3 2024, 12:11 PM · SRE, Traffic
ayounsi added a comment to T366193: Anycast ns1.wikimedia.org.
  • i.e. one in Germany which will pick ns0 rather than lower latency ns2

Seems like the main one is adguard-dns.com, which picks them randomly.
https://w.wiki/AGmr

Jun 3 2024, 8:09 AM · SRE, Traffic
ayounsi added a comment to T366193: Anycast ns1.wikimedia.org.

I think the difficult part is where to stop the overengineering, for example it could make sens to use Liberica to healthcheck/advertise one of the NS anycast IP, but it might not be worth using a different AuthDNS software on half the servers, or a different Puppet infra.
Before going full anycast we need to make sure we're covering all major failure scenarios, or alternatively making a call to keep some unicast, knowing some places with broken/dumb implementation won't be the fastest, but maybe an ok tradeoff for better resiliency.

Jun 3 2024, 7:43 AM · SRE, Traffic

May 31 2024

ayounsi added a comment to T366193: Anycast ns1.wikimedia.org.

That's quite interesting seeing the variation of tradeoffs, and can be quite (an important) rabbithole. Is the goal to figure it out before anycasting ns1, or first anycast ns1 from anywhere then figure out how to modify the setup for possible better redundancy.
It could be useful to list all the failure scenarios, and if we need to mitigate them or not. (server or network missconfig, Bird bug, etc). In other words are we putting too many of our eggs in the same basket ?

May 31 2024, 8:04 AM · SRE, Traffic

May 30 2024

ayounsi added a comment to T366193: Anycast ns1.wikimedia.org.

assign a /24 from https://netbox.wikimedia.org/ipam/aggregates/ to be used for this

As we couldn't get a /24 from LACNIC for magru, we only have two free /24s
We have to decide between a few options:

  1. allocate a new whole /24 for ns1 right now
    • Pro: quick turnaround, no added cost
    • Con: risk of lacking public v4 IPs for future projects (eg. new pops) or core site growth, can be mitigated by applying for more prefixes in parallel but not guaranty to get them
  2. Apply (and pay) for more IPs at RIPE or ARIN (T288342) and wait for an allocation before anycasting ns1
    • Pro: limited cost, more flexible on IP usage
    • Con: long turnaround (months to years)
  3. Buy a /24 on resale market
    • Pro: faster turnaround
    • Con: higher cost
  4. Don't Anycast ns1
    • Listing it only for the sake of completeness, but not preferred. Even though there is a diminishing return after anycasting ns2, we believe anycasting one more ns would bring performance improvements to users
  5. Use the DoH Anycast prefix for ns1
    • Pro: quick turnaround, no added cost
    • Con: risk of providers blocking ns1 as a side effect of blocking DoH, mitigated by having 2 other NS.
May 30 2024, 10:36 AM · SRE, Traffic
ayounsi closed Unknown Object (Task), a subtask of T346722: Sao Paulo, Brazil, South America POP tracking task, as Resolved.
May 30 2024, 7:29 AM · ops-magru

May 28 2024

ayounsi added a comment to T365687: Improve calico-typha firewall rules.

The Typha firewall service is now based on firewall::service and does dynamic name resolution on the puppet server side, let's see if this improves things with the next rename.

The issue didn't happen again, but we also did the move vlan in addition to the rename (so the IP changed too).

May 28 2024, 5:28 PM · serviceops, Prod-Kubernetes, Kubernetes

May 27 2024

ayounsi claimed T365697: Arelion IPv6 transit renumbering.
May 27 2024, 2:27 PM · Patch-For-Review, Infrastructure-Foundations, netops
ayounsi created P63281 move-vlan dry-run error.
May 27 2024, 9:12 AM
ayounsi added a comment to T362523: Juniper: use export-format state-data json compact.

JTAC was able to confirm/duplicate the bug on 22.3R3-S2.4, they're escalating it to their engineering team.

May 27 2024, 7:35 AM · netops, Infrastructure-Foundations

May 24 2024

ayounsi added a comment to T362523: Juniper: use export-format state-data json compact.

Opened JTAC case 2024-0524-163553

May 24 2024, 7:52 AM · netops, Infrastructure-Foundations

May 23 2024

ayounsi added a comment to T355750: CFSSL gencert "remote error: tls: certificate require".

sudo cookbook sre.network.tls --system lsw1-f8-eqiad

May 23 2024, 2:24 PM · CFSSL-PKI, Infrastructure-Foundations
ayounsi added a comment to T365571: Rename wikikube worker nodes during OS reimage.

Before I forget, please notify DCops so they update the physical labels on the server.

May 23 2024, 12:47 PM · Kubernetes, Prod-Kubernetes, serviceops
ayounsi updated the task description for T365697: Arelion IPv6 transit renumbering.
May 23 2024, 12:19 PM · Patch-For-Review, Infrastructure-Foundations, netops
ayounsi created T365697: Arelion IPv6 transit renumbering.
May 23 2024, 12:12 PM · Patch-For-Review, Infrastructure-Foundations, netops
ayounsi triaged T365694: Cookbooks: move Netbox IP allocation to spicerack module as Low priority.
May 23 2024, 11:56 AM · Infrastructure-Foundations, SRE-tools, netbox, Spicerack
ayounsi triaged T365680: Redfish _get_dummy_response() should return empty json as Low priority.
May 23 2024, 8:49 AM · Infrastructure-Foundations, SRE-tools, Spicerack

May 22 2024

ayounsi added a comment to T365571: Rename wikikube worker nodes during OS reimage.

Sounds good ! I'd recommend doing first a rename then normal re-image, then just a move-vlan, then on a different host, test both actions one after the other.

May 22 2024, 1:58 PM · Kubernetes, Prod-Kubernetes, serviceops
ayounsi added a comment to T362523: Juniper: use export-format state-data json compact.

Tested on a MX204 running Junos 21.2 and 22.4R3.25, the returned JSON is invalid...

May 22 2024, 1:13 PM · netops, Infrastructure-Foundations
ayounsi added a comment to P62883 cr3-ulsfo> show interfaces xe-0/1/2 | display json - list not being defined properly.

Diff:

@@ -100,7 +100,7 @@
                 {
                 }, 
                 "address-family" :
-                {
+                [{
                     "address-family-name" : "inet", 
                     "mtu" : "4456", 
                     "max-local-cache" : "100000", 
@@ -142,7 +142,7 @@
                         "internal-flags" : "0x0"
                     }, 
                     "interface-address" :
-                    {
+                    [{
                         "ifa-flags" :
                         {
                             "ifaf-current-preferred" : "[null]", 
@@ -174,6 +174,7 @@
                             }
                         }
                     }
+                    ]
                 }, 
                 {
                     "address-family-name" : "multiservice", 
@@ -182,9 +183,8 @@
                     {
                         "internal-flags" : "0x0"
                     }
-                }
+                }]
             }
         }
     }
 }
May 22 2024, 1:02 PM
ayounsi added a comment to P62883 cr3-ulsfo> show interfaces xe-0/1/2 | display json - list not being defined properly.

Valid:

{
    "interface-information" :
    {
        "physical-interface" :
        {
            "name" : "xe-0/1/2", 
            "admin-status" : "up", 
            "oper-status" : "up", 
            "local-index" : "164", 
            "snmp-index" : "536", 
            "description" : "Transit: Arelion (IC-) {#1071}", 
            "link-level-type" : "Ethernet", 
            "sonet-mode" : "LAN-PHY", 
            "mtu" : "4470", 
            "mru" : "4478", 
            "source-filtering" : "disabled", 
            "speed" : "10Gbps", 
            "bpdu-error" : "none", 
            "ld-pdu-error" : "none", 
            "l2pt-error" : "none", 
            "loopback" : "none", 
            "if-flow-control" : "enabled", 
            "if-speed-cfg" : "Auto", 
            "pad-to-minimum-frame-size" : "Disabled", 
            "if-device-flags" :
            {
                "ifdf-present" : "[null]", 
                "ifdf-running" : "[null]"
            }, 
            "ifd-specific-config-flags" :
            {                           
                "internal-flags" : "0x100200"
            }, 
            "if-config-flags" :
            {
                "iff-snmp-traps" : "[null]", 
                "internal-flags" : "0x4000"
            }, 
            "if-media-flags" :
            {
                "ifmf-none" : "[null]"
            }, 
            "physical-interface-cos-information" :
            {
                "physical-interface-cos-hw-max-queues" : "8", 
                "physical-interface-cos-use-max-queues" : "8", 
                "physical-interface-schedulers" : "0"
            }, 
            "current-physical-address" : "f0:4b:3a:ef:7e:45", 
            "hardware-physical-address" : "f0:4b:3a:ef:7e:45", 
            "interface-flapped" : "2023-03-09 08:13:40 UTC (62w6d 04:14 ago)", 
            "traffic-statistics" :
            {
                "input-bps" : "7184824", 
                "input-pps" : "8233", 
                "output-bps" : "85449600", 
                "output-pps" : "8470"
            }, 
            "active-alarms" :
            {
                "interface-alarms" :    
                {
                    "alarm-not-present" : "[null]"
                }
            }, 
            "active-defects" :
            {
                "interface-alarms" :
                {
                    "alarm-not-present" : "[null]"
                }
            }, 
            "ethernet-pcs-statistics" :
            {
                "bit-error-seconds" : "3", 
                "errored-blocks-seconds" : "3"
            }, 
            "interface-transmit-statistics" : "Disabled", 
            "logical-interface" :
            {
                "name" : "xe-0/1/2.0", 
                "local-index" : "343", 
                "snmp-index" : "555", 
                "if-config-flags" :
                {
                    "iff-up" : "[null]", 
                    "iff-snmp-traps" : "[null]", 
                    "internal-flags" : "0x4004000"
                }, 
                "encapsulation" : "ENET2", 
                "policer-overhead" :    
                {
                }, 
                "traffic-statistics" :
                {
                    "input-packets" : "1184552371726", 
                    "output-packets" : "1206955771514"
                }, 
                "filter-information" :
                {
                }, 
                "address-family" :
                [{
                    "address-family-name" : "inet", 
                    "mtu" : "4456", 
                    "max-local-cache" : "100000", 
                    "new-hold-limit" : "100000", 
                    "intf-curr-cnt" : "1", 
                    "intf-unresolved-cnt" : "0", 
                    "intf-dropcnt" : "0", 
                    "address-family-flags" :
                    {
                        "ifff-rpf-check" : "[null]", 
                        "ifff-rpf-loose-mode" : "[null]", 
                        "ifff-sendbcast-pkt-to-re" : "[null]", 
                        "internal-flags" : "0x0"
                    }, 
                    "interface-address" :
                    {
                        "ifa-flags" :
                        {               
                            "ifaf-current-preferred" : "[null]", 
                            "ifaf-current-primary" : "[null]"
                        }, 
                        "ifa-destination" : "80.239.192.64/30", 
                        "ifa-local" : "80.239.192.66", 
                        "ifa-broadcast" : "80.239.192.67"
                    }
                }, 
                {
                    "address-family-name" : "inet6", 
                    "mtu" : "4456", 
                    "max-local-cache" : "75000", 
                    "new-hold-limit" : "75000", 
                    "intf-curr-cnt" : "2", 
                    "intf-unresolved-cnt" : "0", 
                    "intf-dropcnt" : "0", 
                    "address-family-flags" :
                    {
                        "ifff-rpf-check" : "[null]", 
                        "ifff-rpf-loose-mode" : "[null]", 
                        "internal-flags" : "0x0"
                    }, 
                    "interface-address" :
                    [{
                        "ifa-flags" :
                        {
                            "ifaf-current-preferred" : "[null]", 
                            "ifaf-current-primary" : "[null]"
                        }, 
                        "ifa-destination" : "2001:2000:3080:a9a::/64", 
                        "ifa-local" : "2001:2000:3080:a9a::2", 
                        "interface-address" :
                        {
                            "in6-addr-flags" :
                            {
                                "ifaf-none" : "[null]"
                            }
                        }
                    }, 
                    {
                        "ifa-flags" :
                        {
                            "ifaf-current-preferred" : "[null]", 
                            "internal-flags" : "0x800"
                        }, 
                        "ifa-destination" : "fe80::/64", 
                        "ifa-local" : "fe80::f24b:3aff:feef:7e45", 
                        "interface-address" :
                        {
                            "in6-addr-flags" :
                            {
                                "ifaf-none" : "[null]"
                            }
                        }
                    }
                    ]
                }, 
                {
                    "address-family-name" : "multiservice", 
                    "mtu" : "Unlimited", 
                    "address-family-flags" :
                    {
                        "internal-flags" : "0x0"
                    }
                }]
            }
        }
    }
}
May 22 2024, 1:02 PM
ayounsi updated the title for P62883 cr3-ulsfo> show interfaces xe-0/1/2 | display json - list not being defined properly from untitled to cr3-ulsfo> show interfaces xe-0/1/2 | display json - list not being defined properly.
May 22 2024, 12:45 PM
ayounsi created P62883 cr3-ulsfo> show interfaces xe-0/1/2 | display json - list not being defined properly.
May 22 2024, 12:35 PM