Resolved Issues
This section lists the issues fixed in the Junos OS main release and the maintenance releases.
For the most complete and latest information about known Junos OS defects, use the Juniper Networks online Junos Problem Report Search application.
Resolved Issues: 15.1R7
Application Layer Gateways (ALGs)
On MX Series routers, when the RTSP ALG is enabled, a certain crafted RTSP interleave data packet might cause the flowd process to crash. Repeated crash of the flowd process constitutes an extended denial-of-service condition for the MX Series routers. [For more information, see http://kb.juniper.net/JSA10721]. PR1116559
An IPsec VPN connection cannot be established successfully, because the Internet Key Exchange (IKE) ALG drops the first response message during the IPsec IKEv2 negotiation. PR1300448
Authentication and Access Control
Malicious LLDP crafted packet leads to privilege escalation and denial of service (CVE-2018-0007). For more information, see https://kb.juniper.net/JSA10830. PR1343600
Class of Service (CoS)
If the hidden command show class-of-service queue-consumption is executed many times (in this case, for 100 times), in a rare condition, the cosd process might crash with a core file generated. The core files could be seen by executing the CLI command show system core-dumps. PR1066009
When CoS is configured, in a very rare situation, because of the timing issue between dcd and cosd during commit, the cosd might crash. For example, if you delete an interface that belongs to an aggregated Ethernet interface and then configure it as a single port with CoS in a single commit, this issue might occur. PR1220524
A forwarding class might be missed in the output of the show class-of-service scheduler-hierarchy interface command. PR1281523
EVPN
In an EVPN scenario with static MAC configured in the EVPN instance, the remote EVPN instance can see the MAC route information. However, after deactivating and activating static MAC in the EVPN instance, and then checking the MAC route information in the remote EVPN instance, no such MAC route is found in the EVPN route table. PR1193754
On MX Series routers with EVPN, the routing protocol process might crash when MAC moves between multihomed PE routers, resulting in traffic loss. PR1216144
In an EVPN all-active multihoming scenario, when you create and roll back an EVPN table, Layer 2 loop and traffic loss occurs. The routing protocol process (rpd) sends a MAC address for a Layer 2 address learning process on creation and a Remote-To-Local-Adv-Done flag. After this point, there is no withdrawal sent for this MAC from the rpd due to a mismatch in a cpmac tree. PR1226436
In an EVPN-MPLS or EVPN-VXLAN environment, if the subinterface is configured with VLAN-aware (instance-type virtual-switch), in a rare condition, the FPC or MPC might crash. PR1274976
Ethernet A-D per Ethernet segment route (Type-1 PER ES) is not generated with a new route target after changing the route target. PR1279529
In a Junos OS platform, the l2ald daemon might crash when MAC address is processing. The MAC learning process will be impacted during the period of l2ald crash. The l2ald recovers itself. PR1347606
Forwarding and Sampling
With l2tp subscribers, after a subscriber's login attempt, all FPCs except the card that hosts subscribers might report the following log message: jnh_if_get_input_feature_list(9723): Could not find ifl state. PR1140527
In an accounting scenario, due to a software defect or a - limit of maximum parallel transferred files, only nine accounting files can be transferred from the Packet Forwarding Engine process resulting in missing accounting files. PR1153708
The firewall module (daemon dfwd) on the Routing Engine always leaks some memory upon configuration commit with the following configurations: set routing-options forwarding-table export qos3, set policy-options policy-statement <policy name> term 1 from source-address-filter <ip-address>, and set policy-options policy-statement <policy-name> term 1 then forwarding-class <forwarding-class>. PR1157714
In a subscriber management environment, the size of the statistics database (and corresponding size of /mfs partition) might constantly increase because of the absence of statistics entry cleanup for certain types of subscribers in a few scenarios. This issue is likely to occur if VLAN-OOB subscribers are present, or if dynamic authenticated VLANs are removed due to expiration of session-timeout. PR1251756
When the statistics about Packet Forwarding Engine PEER send or receive counters get wrapped around during a hearbeat scenario when the system runs for a long time, the Packet Forwarding Engine process might crash. PR1266025
Error messages such as SNMP_EVLIB_FAILURE: PFED ran out of transfer credits with PFE.Failed to get stats. ifl index are seen in syslog. PR1270686
With Routing Engine-based sampling configured, it might be observed that the chassis stops exporting flow records after every 5-7 days. PR1270723
When the firewall filter is configured with a wildcard (*.*) (such as "from interface ge-*.*") as matching condition, the filter might be incorrectly programmed into the Packet Forwarding Engine, then the firewall filter might not work. PR1274507
In some circumstances, the traffic is still forwarded out of nonphysical interfaces such as gr-/ae interfaces even after the nonphysical interfaces are disabled. Once the MAC address is aged out, the traffic stops. PR1277697
When the FPCs are busy in high churn scenarios, because the srrd thread in the Packet Forwarding Engine has low priority, CPR resources are insufficient to process the messages sent by the srrd process. Due to this, the queue for these busy FPCs is piling up in the srrd and eventually leading to a crash. PR1284918
The sampled might crash if traceoptions are enabled. PR1289530
When subscriber services that are enabled for interim volume accounting go down, in rare cases the Packet Forwarding Engine process (pfed) might generate a core file with backtrace pfed_timer_manager_c::remove_serv_id. The pfed automatically recovers over the restart and no corrective action is required. PR1296969
When the following example configuration is applied and the archive sites are not reachable, the archiving accounting files might fail and finally the accounting data might be missed: accounting-options { file reStats { files 96; transfer-interval 5; compress; backup-on-failure { master-only; } push-backup-to-master; archive-sites { "<remote-site>"; } }. PR1300764
In a subscriber management environment, the dfwd process might crash during execution of the show firewall templates-in-use command if a CLI session disconnects before the complete output of this command is received. PR1305284
If two archive sites are configured under the [accounting-options file <filename>] configuration hierarchy, the first archive site listed uses the SFTP protocol and is not reachable. The accounting files backup might occur to the second site listed. PR1311749
General Routing
On MX Series routers with multiple MPCs (for example, 10 MPCs), during a unified ISSU, some of the MPCs might go offline permanently because the upgrade process takes more time. PR1005030
Transit LDP packets go to the host path. PR1011598
The following errors are seen in the logs: Err] pfeman_private_msg_enqueue 198 : Allocation failed, No Memory Err] pfeman_private_session_manager 541 : pfeman_private_msg_enqueue failed. This is a memory allocation failure because the Packet Forwarding Engine was running out of memory. Continuous usage of memory pointer values without checking for validity leads to a Packet Forwarding Engine crash. PR1022542
For Junos OS Releases 13.3R5, 14.1R1, and later, the MX Series Virtual Chassis interchassis TCP control flows are changed to Virtual Chassis high priority, so a high volume of VC interchassis TCP control flow might impact Virtual Chassis stability and responsiveness to external protocol events. With the fix, the priority of Virtual Chassis interchassis TCP control flow has been reverted. PR1074760
Processes (or daemons) using a synchronous API can get stuck because these APIs are blocking in nature and do not allow a mib2d or ifinfo to perform any activity during this period. For example, NMS queries on interfaces (for which a mib2d shall respond) could time out if a mib2d is stuck in such a state. PR1078505
During logical interface cleanup rtsock_peer_unconsumed_obj_add:object already deleted log messages might indicate that the search failed without citing incorrect results. PR1085626
Memory leak is seen in the LSP attributes object for "RSVP 16" memory block. When there is an error during creation of the RSVP path state (the PSB data structure), the data structure itself is freed but some associated memory is not freed. This is causes memory leak. It is very unlikely that this error condition ever happens on an NSR master Routing Engine (or when no NSR is configured). But on the NSR backup Routing Engine, there are more likely to be conditions that cause the path state creation to fail, thus exposing the memory leak in the error handling code. PR1115686
Dynamic tunnel interface bounces causes memory corruption, which leads to an rpd crash. The new rpd process synchronizes with the kernel, which might have stored the information about the GRE tunnel logical interface created by the previous rpd process. The new rpd process uses this information from the kernel, leading to subsequent rpd crashes being triggered. The following logs might be seen when the issue occurs: user@host>show log messages| match "Address already in use" %DAEMON-3: Error creating dynamic logical interface from sub-unit 32792: Address already in use %DAEMON-3-RPD_KRT_Q_RETRIES: kqp 0x49df00d0: op add queue low-add attempts 4010 ifd index 284, ifl unit 32792, family 2 instance id 0, state CreateIFL RPD_KRT_Q_RETRIES: IFL IFF Update: Address already in use. PR1152912
The Junos OS supports the setting of interface-mode trunk even though vlan-tagging or flexible-vlan-tagging is not in effect on the local interface. This results in a MTU that is 4 bytes smaller than the one when vlan-tagging is set. The difference in supported MTU can lead to unexpected fragmentation issue, which results in silent discard in a Layer 2 network. PR1154024
When upgrading Junos OS software on RE1, and if at the time, RE1 is the master Routing Engine, both Routing Engines might be in backup state, resulting in losing remote connectivity and all interfaces. Only console access is available at this time. PR1172729
On MX240, MX480, and MX960 platforms, due to resources contention during multiple commit processes, the kernels might display I2C bus errors. PR1174001
Port block efficiency and unique pool users statistics display incorrect values when the NAT pool is modified dynamically with CGNAT traffic for the particular term in the NAT rule. PR1177244
On MX240, MX480, MX960, MX2010, and MX2020, offlining one FPC might lead the fabric chip to have some stale packets corresponding to the destination that went down. As a result, traffic loss might be observed. PR1185901
After loading CoS-related configuration on MPC5E, MPC6E, MPC2E-NG, and MPC3E-NG line cards, error messages might be seen. . PR1186645
On MX240, MX480, MX960, MX2010, and MX2020 platforms, in rare cases, the MPC4 line card might never come back online after rebooting the chassis by using the request system reboot both-routing-engine command. PR1190418
On MX Series routers with NAT service configured on AMS interfaces, after rebooting the FPC or PIC, the NAT pool split between AMS members is incorrect. There are overlapping IP pools and sometimes missing pools, causing NAT to not work correctly. PR1190461
When PIC PB-4OC3-4OC12-SON-SF (4x OC-12-3 SFP) is replaced with PB-4OC3-1OC12-SON2-SFP (4x OC-3 1x OC-12 SFP) and a CLI commit is done, the replacement PIC type bounces. PR1190569
As described in RFC 7130, when LACP is used and considers the member link to be ready to forward traffic, the member link might not be used by the load balancer until all the micro-BFD sessions of the particular member link are in the up state. PR1192161
In an MX Series BNG subscriber management environment, RADIUS accounting statistics provided by the MX Series BNG might slightly deviate from the actual statistics if the subscriber session is terminated abruptly while traffic flow was active. PR1192775
In port mirroring, IPv4 inbound traffic might not get mirrored to the 10G analyzer interface in a certain interface type. PR1194139
Crash with a core file is seen when an IPv6 firewall filter with syslog action is configured and applied on VMX, MPC7, MPX8, and MPD9 cards. PR1195706
On MX Series platforms with MPC5E installed, in a high-temperature situation, the temperature thresholds for triggering the high-temperature alarm and controlling fan speed are based on the FPC level. Any sensor values in the FPC that exceed the temperature threshold of the FPC trigger the actions associated with temperature thresholds. PR1199447
A stale VBF flow entry a left after subscribers were migrated from one port to another, leading to the IP address being subsequently unusable on platforms running a Junos OS enhanced subscriber management release. PR1204369
When PPPoE subscribers log in to or out of the device, an SNMP link up or down trap is generated by the system if no-trap is configured in the corresponding dynamic-profile. PR1204949
In some rare scenarios, the remote VPLS PE router coming up might cause TCP keepalive timeouts on the local sockets between the master Routing Engine and the FPCs (for example, ppmd <-> PPManager connection): kernel: tcp_timer_keep: Dropping socket connection due to keepalive timer expiration Local(0x80000001:6011) Foreign(0x80000015:36678) kernel: tcp_timer_keep: Dropping socket connection due to keepalive timer expiration Local(0x80000001:6011) Foreign(0x80000012:25385) kernel: tcp_timer_keep: Dropping socket connection due to keepalive timer expiration Local(0x80000001:6011) Foreign(0x80000013:5934). The problem is caused by a delay in packet processing on the em0 interface (including the TCP keep alives from FPCs). This problem might also occur if there is any network churn and delay in processing the keepalive for some other sockets. The keepalives of other sockets are randomly dropped, resulting in protocol flaps. PR1209308
BGP PIC installs multiple MPLS LSP next hops as active instead of standby in the Packet Forwarding Engine. This might cause a routing loop. PR1209907
On MX Series routers with MPC3, MPC4, MPC5, MPC6, MPC2-NG, and MPC3-NG line cards, the chassisd process crashes continuously on both Routing Engines because some failure cases caused by underlying software and hardware are not handled gracefully. Both Routing Engines might loose mastership and get stuck in backup mode. PR1213808
On M Series, MX Series, and T Series routers, enabling the VRRP delegate-processing ae-irb feature might cause VRRP and BFD to flap. PR1219882
PPPoE or DHCP subscribers fail to bind due to ProcessPADIFailedUiflNotActive/SML_CLIENT_DELETE_SDB_ADD_FAILED errors after continuous login and logout, and subsequent login fails. PR1221690
On rare occasions, offlining a MIC-3D-16CHE1-T1-CE MIC might cause an FPC core file. This is unlikely to occur in general and chances of it happening are very low. PR1223277
Multiple vulnerabilities in stunnel software included with Junos OS have been resolved by upgrading stunnel to 5.38. Refer to https://kb.juniper.net/JSA10852 for more information. PR1226804
Flowstat reply has incorrect DL type. PR1228383
The following log is not an error and also does not indicate any functionality break or impact cc_mic_irq_status: CC_MIC(5/2) irq_status(0x1d) does not match irq_mask(0x20), enable(0x20), latch(0x1d). The message is seen continuously for MIC-3D-4OC3OC12-1OC48. PR1231084
On MX Series platforms with MPC2E-3D-NG, MPC2E-3D-NG-Q, MPC3E-3D-NG, and MPC3E-3D-NG-Q line cards, if the FPC-MIC link failure occurs, the bridge might keep sending register messages in an infinite loop causing continuous PCI exception. The MPC might crash and traffic forwarding might be affected. This is a rare issue and it is difficult to reproduce. PR1231167
An MX Series router running Junos OS Release 14.1R9 might display the error message _FPC: Error requesting SET BOOLEAN, illegal setting 39 [CM_BOOLEAN_ROUTE_MEMORY_ENHANCED]. PR1232626
When there is an MS-MPC card installed in an MX Series router, the MS-MPC card might crash when OSPFv3 IPv6 traffic goes through it, impacting all the services running on the MS-MPC card. PR1233459
FPCs on the MX960 platform might be stuck in offline state with FPC Incompatible with SCB due to a delayed PEM startup. PR1235132
When non-Juniper Networks SFP is used in an MIC-3D-20GE-SFP-E or MIC-3D-20GE-SFP-EH MIC, the ISR 2 (MIC error interrupt) might be running off over 2.5 seconds due to an unknown reason, and then the FPC hosting the MIC might restart and crash. PR1235475
In a race condition, ksyncd crash might be seen on the new master Routing Engine after performing unified ISSU or GRES switchover. This issue is difficult to reproduce. PR1241875
After detaching the last traffic-bearing physical interface stream, the cleanup is not proper and it might result in issues. PR1243547
Currently MS-MIC supports a maximum of 2 million routes scale. This includes all IPv4, IPv6, and MPLS routes in the system. When scale limit is exceeded, the forwarding database (FDB) memory will be exhausted and the MS-MIC will start to drop the routes and also print logs. PR1243581
MX Series with MPC3E, MPC4E, MPC5E, MPC6E, MPC2E-NG, and MPC3E-NG linecards might drop traffic under high temperature (67 degrees Celsius or higher). PR1244375
SPMB reboot causes fabric traffic to be silently dropped or discarded for more than 1 minute in T Series. PR1248063
Accounting statistics are not correctly preserved across unified ISSU upgrades. PR1250919
If the Media Access Control Security (MACsec) session flaps, dot1x might crash and generate a core file, and then the MACsec session might fail to be established. PR1251508
Malicious LLDP crafted packet leads to privilege escalation and denial of service (CVE-2018-0007). Refer to https://kb.juniper.net/JSA10830 for more information. PR1252823
The Ethernet OAM Link-Fault Management (EOAM LFM) adjacency on XM-based MPC might flap when the unrelated MIC that is in the same MPC slot is brought online and a short OAM interval is configured (such as OAM pdu-interval 100 ms and pdu-threshold 3). Note: XM-based MPCs include MPC2E-NG, MPC3E-NG, MPC3E, and MPC6E. PR1253102
On MX Series routers with MPC2E-NG and MPC3E-NG, the interfaces of these line cards might not come up when connecting to a third-party transport switch. PR1254795
IRBs that are part of an L3 multicast group allocate ASIC memory when added to the group. A small amount of this memory is not freed when changes are made to the L3 multicast group. This could cause a crash due to an out-of-memory condition if there are continuous changes to multicast groups with IRBs over a long period of time. PR1255290
A random heap block corruption is caused when flow is added to pseudo logical interface when flow's associated logical interface is non-vbf logical interface. When this condition is met, FPC crashes and generates a core file. PR1256065
The transmit delay interval is the maximum time the key server waits before installing a new TX SAK (default value is 6 seconds). When MKA transmit interval is set to 6 seconds, during key rollover both transmit interval and delay interval timers expire at the same time and a new TX SAK gets installed on the key server before the RX SAK is installed on the peer node, causing traffic drop. PR1257041
Unable to run show subscribers extensive and some other CLI commands after GRES because subscriber-management database is unavailable. The other symptoms of the bug can be similar to messages like sdb.db: close: Bad file descriptor after commit full. PR1258238
In a subscriber service environment, the device control process (dcd) might restart unexpectedly during commit process after changes to ATM interface configuration. PR1258744
In case that license keys are activated in the system through the configuration, which would mean that under the system license keys configuration stanza, certain events or changes can make them noneffective. Those events or changes include Routing Engine mastership switchover or group-related configuration changes. PR1259460
Class of service (CoS) does not correctly classify egress L3 multicast traffic from an ingress VLAN bridge interface after a configuration change. PR1260413
On MIC-3D-20GE-SFP-E or MIC-3D-20GE-SFP, when SFP diagnostic information is being read out periodically, due to a malfunctioning SFP or noise on the I2C BUS, the SFP thread might hog CPU resources, and a watchdog check will restart the MPC to recover. Enhancements prevent the SFP thread hogging and MPC restart. PR1260517
In PPPoE subscriber management environment, when the subscribers logout, many link control protocol (LCP) term request and PPPoE Active Discovery Termination (PADT) messages do not get a response from MX Series for a lot of sessions. This might impact service. PR1260626
The first multicast IP packet is saved while waiting for a route to be resolved. PR1260729
When a dynamic profile version update is followed by GRES immediately, without even a single subscriber attached in between, new subscribers might not be allowed to be attached. In this case, the jpppd daemon, which does not have the updated profile database, might cause this issue. PR1260836
In an MX Series BNG subscriber management environment, there could be a slight deviation in the service accounting statistics when the subscriber session terminates abruptly. PR1260898
On MX Series routers, in a rare case the backup Routing Engine is slow to process replication. Replication on the master Routing Engine continues too long under a purge condition and results in logic problems and smgd crash on the backup Routing Engine. PR1261268
During multicast activation of dynamic subscribers through a service profile, the bbe-smgd daemon in the backup Routing Engine might crash. PR1261285
On MX Series routers with QSFP optics, receive-loss cleared and set messages will repeat when the laser is down, even when actual flapping does not occur, and overwhelm the messages file. PR1261793
In a subscriber management scenario, it is observed that an authenticated dynamic VLAN interface with an idle-timeout is removed if there are no subscribers on top and if remove-when-no-subscribers is configured at the auto-configure stanza. The dynamic VLAN interface is removed when the idle timeout expires if the interface stayed idle during this period. PR1262157
In a BNG subscriber with authentication based on RADIUS[26-1] attribute or domain-map scenario, if one subscriber is authenticated and then relocated to a corresponding routing instance based on RADIUS[26-1] attribute or domain-map, the ICMP network unreachable message might not get sent back to the subscriber client. PR1263094
The dynamic VLAN interface is logged out upon reaching idle-timeout even though there is a client session (PPPoE or DHCP) above it. The proper behavior is to keep the dynamic VLAN interface in case a client session (PPPoE or DHCP) is present above the dynamic VLAN interface. PR1263131
With subscribers connected, when you run the show arp command and afterward execute other CLI commands, a delay in the display of output is seen. PR1264038
The peer side of the TCP session of BGP is violating the window given by Junos OS and sends more data because of NSR day-one issues. That is, the backup TCP gets stuck and finally holdtime expires after GRES instead of dropping the packets. PR1264436
In a scaled number of VRF instances scenario with vrf-table-label configured, the rpd might crash after deleting some VRF instances. PR1264464
The subscribers are unable to connect at the high number of configured dynamic profiles (180-200). PR1264629
Because of transient hardware error conditions, only syslog events XMCHIP(x) FI: Cell underflow at the state stage - Stream 0, Count 65535 are reported, which is a sign of a fabric stream wedge. Additional traffic flow register pointers are validated and if stalled a new CMERROR alarm is raised: XMCHIP(x) FI: Cell underflow errors with reorder engine pointers stalled - Stream 0, late_cell_value 65535, max_rdr_ptr 0x6a9, reorder_ptr 0x2ae. PR1264656
On MX Series routers with MS-MPC, with Ethernet frames with more than 2000 bytes of payload, the mspmand process that manages the multiservices PIC might crash. Traffic forwarding might be affected. PR1264712
In some situations, MX Series LAC does not encapsulate packets received from CPE in l2tp tunnel if this subscriber has a static pp0 unit configured on the LAC side. This issue is causing a permanent traffic black hole condition (in which traffic is silently dropped) for this subscriber and leads to PPP session flaps or inability to establish a PPP session between CPE and LNS when using lcp renegotiation on the LNS side. PR1265414
PCC-controlled LSP metric is not getting updated on the controller, and PCE-delegated LSPs do not come up. PR1265864
If the dynamic VLAN profile does not have an interface family (IFF) configuration (for example, family PPPoE or family inet), but has a firewall filter configuration, firewall filter indixes will not be released after the dynamic VLAN is removed. This eventually leads to the depletion of available firewall filter indexes. PR1265973
According to IETF RFCs, IGMPv3 & MLDv2 reports are not sent to IANA reserved multicast addresses 224.0.0.22(IGMP V3 ROUTERS) and ff02::16(MLD V2 ROUTERS), respectively and should be discarded. But BNG processes these reports. With this fix, the reports are discarded and the Rx error counter is updated. PR1266309
In a rare condition, the kernel running in the Routing Engine might keep rejecting connection from the FPC due to the inconsistent connection state between the Routing Engine and the Packet Forwarding Engine. PR1266379
When VSTP is enabled on a double-tagged aggregated Ethernet logical interface and there is another single-tagged aggregated Ethernet logical interface configured with the same router VLAN tag, then the incoming traffic on that VLAN incorrectly hits the AE_RESERVED_IFL_UNIT (AEx.32767) and the traffic gets dropped. PR1267238
The bbe-smgd process might crash and generate a core file under certain boundary conditions on the standby Routing Engine with certain specific configurations. Because the core is on the standby, no disruption in service is expected, and the system recovers from this condition. PR1267646
On MX Series routers, while configuring dynamic VLANs for subscriber access networks and DVLANs are authenticated, if the bbe-smgd process is restarted during high subscriber churn, all subscribers might have difficulties connecting to the BNG or might not be able to log in at all. PR1267704
The CLI command set chassis effective-shaping-rate is enabled for the MX104. PR1267829
In an OpenFlow scenario, an OpenFlow filter is automatically created for each OpenFlow logical interface. In a rare race condition, when an OpenFlow filter is deleted and queried at the same time (for example, you delete an OpenFlow logical interface on one terminal while executing show openflow filters on another terminal), the openflowd process might get stuck in a loop, which might lead to 100 percent CPU usage. The OpenFlow filter query commands are as follows: show openflow filters, show openflow filters interface, and show openflow filters switch . PR1268527
A low-memory condition puts the service PIC into the red zone on the MS-MIC or MS-MPC card when the SIP ALG is used. This might cause the SIP ALG to generate a core file. PR1268891
On MIC-3D-20GE-SFP-E and MIC-3D-20GE-SFP-EH, an interrupt threshold is introduced. If MIC error interrupts are more than the threshold (greater than 2500 per 5 minutes), then the MIC or FPC is restarted. As a result, an MIC error interrupts and overloads the CPU when restart is initiated. PR1270420
The Routing Engine might stop all services after GRES or unified ISSU. This issue is caused by corrupted Berkeley DB file after GRES or ISSU. PR1271306
Changing the mode of the interfaces causes the interface to go down or up. For the interface to be down, all the queues (in/out) associated need to be emptied. Due to a certain condition, the queue does not get emptied and the interface pointer does not get freed properly, resulting in an FPC crash. PR1273462
The mspmand log incorrectly generates messages about memory zone level. This occurs every 49.7 days and will recover by itself. This is a display issue and will not affect traffic. PR1273901
In a subscriber management scenario, due to unavailability of the subscriber-management database (SDB), many CLI commands related to subscribers such as show subscribers detail, show subscribers extensive, and so on might not work. PR1274464
On MX Series with MPC7E-10G, MPC7E-MRATE, MX2K-MPC8E, or MX2K-MPC9E line cards with continuous fabric re-order events might drop all packets of that fabric stream. Subsequently, the re-order engine might be stalled and might not recover anymore. PR1276301
On an MX Series platform with MS-MPC or MS-MIC installed, a Security Policy Database (SPD) memory leak might be observed after adding or removing the service-set statement from the configuration. The Security Policy Database (SPD) eventually crashes due to memory exhaustion. PR1276809
IS-IS adjacency does not come up over the lsp interface between ACX Series and MX Series platform. However, adjacency works fine on ACX Series to ACX Series, MX Series to MX Series, and ACX Series to M Series (MS-DPC). PR1278377
When different routing instances (with "multipath" configured) learned the BGP same-prefix routes, and exported them from one instance to another through the auto-export command, the rpd process might get stuck. PR1279260
The packets with unexpected tag-protocol-id (TPID) for aggregated Ethernet interfaces were not computed correctly. Also, the packets with TPIDs that are considered invalid were not dropped, but instead were getting stuck in a recursive processing loop that leads to the core. PR1279402
On MX Series routers in a subscriber scenario, if class of service (CoS) is applied to the subscriber, when issuing some changes to an aggregated Ethernet (AE) bundle, CoS might not work as expected. PR1279788
After a MS-MPC or MS-PIC goes offline or online or gets bounced (because of an AMS configuration change), sometimes the PIC might take approximately 400 seconds to come up. PR1280336
In a subscriber management environment, if the authenticated subscriber dynamic VLAN receives idle timeout from the RADIUS server, due to a rare timing issue, the dynamic VLAN interface might be removed immediately after it was successfully created. PR1280990
The service-accounting-deferred for the L2BSA subscriber ingress firewall filter does not include non-IP traffic statistics. PR1281201
In a subscriber management environment, some subscribers might not be able to connect to the MX Series broadband network gateway (BNG) and might get stuck in Init state if the autoconf process fails to access the Session DataBase (SDB) during their login attempts. If the problem is observed, all consequent login attempts for the affected subscribers will fail. PR1281896
The issue was seen during aggregated Ethernet configuration activation or deactivation. Junos OS ended up in a transient situation where the aggregated Ethernet interface has no child to inline-ka but was attempting to clear the inline-ka unilist selector. Later on during the ageout, inline-ka delete again tried to clear the same selector this resulted in an FPC crash. PR1282022
A routine within an internal Junos OS sockets library is vulnerable to a buffer overflow. Malicious exploitation of this issue might lead to a denial of service (kernel panic) or be leveraged as a privilege escalation through local code execution. The routines are only accessible through programs running on the device itself, and
veriexec
restricts arbitrary programs from running on Junos OS. There are no known exploit vectors utilizing signed binaries shipped with Junos OS itself. Refer to https://kb.juniper.net/JSA10792 for more information. PR1282562In a rare corner case, the kernel might crash and a core file might be generated. PR1282573
Unrelated configuration changes related to a routing instance result in invalid or incomplete inline J-Flow data packets. PR1282580
VBF flows are not programed correctly on aggregated Ethernet interfaces, resulting in 50 percent traffic loss. PR1282999
GRE Operation, Administration, and Maintenance (OAM) fails to come up when the GRE tunnel source and the family inet address are the same (as shown in the following configuration statements): set interfaces ge-0/0/0 unit 0 family inet address a.b.c.d/30 set interfaces gr-0/0/1 unit 0 tunnel source a.b.c.d set interfaces gr-0/0/1 unit 0 tunnel destination x.x.x.x set interfaces gr-0/0/1 unit 0 family inet unnumbered-address ge-0/0/0.0 set protocols oam gre-tunnel interface gr-0/0/0.0 keepalive-time x set protocols oam gre-tunnel interface gr-0/0/0.0 hold-time x. PR1283646
In Junos OS, bbe-smgd process denial of service is observed while processing VLAN authentication requests or rejects (CVE-2018-0006). Refer to https://kb.juniper.net/JSA10834 for more information. PR1284213
When the service set has both a NAT rule and a stateful firewall rule configured and a source IP address could not be matched with any NAT rule but could be matched with a stateful firewall rule, the PPTP session from this source IP address might not be successfully established. PR1285207
On the MX104, LC, PFH, or Packet Forwarding Engine interfaces do not come up on RE1 if the router is booted with single Routing Engine on slot1. PR1285606
This issue affects single Packet Forwarding Engine MX Series routers such as MX80 and MX104 and affects all types of DDoS packets. In affected releases, you will not see message logs DDOS_SCFD_FLOW_FOUND pop when there is a culprit flow is found. In affected releases, you will not see proper output from show ddos-protection protocols xxx (culprit-flows) output. PR1286521
After the first GRES, BBE_SMD_MSG_GET_PSEUDO_IFL_FAIL error is displayed on the new backup. This error might cause some routes on the backup Routing Engine to be created with a null next hop. PR1286849
SNMP query for IF-MIB::ifOutQLen reports Wrong Type should be Gauge32 or Unsigned32 for a dynamic VLAN demux0 interface. PR1287852
The bbe-smgd process could crash when upgrading software by using the request system software install <image-name> no-validate reboot command on both RE0 and RE1 with active PPPoE subscribers. PR1288121
The routing protocol process (rpd) might not immediately notify the kernel to reinstate the direct routes associated with an interface coming up. PR1288492
Performance issues are seen when nontranslated traffic is introduced to a serviceset using a large number of NAT terms. When 2000 NAT terms were present and a few 100 pps did not match any of these NAT terms and also went through nontranslated, this performance issue is seen. PR1288510
After GRES, the smid process thrashes and does not restart after the fatal SDB error. PR1288871
In MX Series routers with Virtual Chassis mode, if the configuration statement heartbeat is enabled, kernel "rtdata" memory might leak and rtdata memory usage might reach a high rate (for example, more than 2 GB). This might affect the performance of the device. PR1289363
When the show hsl2 statistics detail command is executed, continuous error logs are seen on next-generation MPC (MPC2E NG, MPC3E NG) in Junos OS Release 15.1Rx. These error logs can be reported for PFE0_XF and fabric channel. If one of the next-generation MPCs is in this state, the CB plane fails because of HSL2 train failure. PR1290645
With IKEv1 aggressive mode, dead peer detection and network address translation traversal might not work because there is no vendor-ID shared. PR1290689
When IGMP protocol is enabled, there might be a leak of 56 bytes in the bbe-smgd process (daemon) during the logout of every subscriber who had joined any multicast group during the session. PR1290918
The kernel might not install the route when static route or static LSP next hop address is the same as address on outgoing interface. PR1291917
When a subscriber using a filter logs out, the filter resource will be freed. But because of the timing issue, the filter index might be freed in the Routing Engine but not in the Packet Forwarding Engine, causing an orphan filter condition. When the subsequent subscriber using the filter tries to log in and if Routing Engine tries to add a new filter with the same index, the Packet Forwarding Engine rejects it. This causes login failure with an error vbf_filter_add_orphan_check. But the subsequent login attempt after this failure might work because the problem index will be removed by the Routing Engine. PR1292582
In a subscriber management environment an error message (fpc[x] jnh_if_vbf_comp_ifl_list_update_queue(x): ifl .pp.[x] (x): donor x pfe [0] Bad jnh instruction x) is triggered while bringing up the subscriber. In case the TCP profile is attached to subscriber’s logical interfaces. This error message might flood for 2 minutes. PR1293057
Junos OS releases with a fix committed in Junos OS Releases 15.1R5-S4, 16.1R4-S3, 16.1R5, and 17.3R1 with XM-based line cards (MPC3E, MPC4E, MPC5E, MPC6E, MPC2E-NG, and MPC3E-NG) might report a DDR3 TEMP ALARM chassisd error log message. PR1293543
CPCD process generates a core file using Routing Engine-based http-redirect. PR1293553
The show extensible-subscriber-services sessions command is reporting the timestamp increased by 1 hour after a unified ISSU. Timestamps might be the same after the unified ISSU. Before the unified ISSU starts, the timestamp might be show extensible-subscriber-services sessions | match Time Timestamp: Wed Jul 12 10:04:57 2017 Timestamp: Wed Jul 12 10:04:57 2017 Timestamp: Wed Jul 12 10:04:57 2017 Timestamp: Wed Jul 12 10:04:57 2017 Timestamp: Wed Jul 12 10:04:57 2017 After ISSU completed: show extensible-subscriber-services sessions | match Time Timestamp: Wed Jul 12 11:04:57 2017 Timestamp: Wed Jul 12 11:04:57 2017 Timestamp: Wed Jul 12 11:04:57 2017 Timestamp: Wed Jul 12 11:04:57 2017 Timestamp: Wed Jul 12 11:04:57 2017. PR1293800
Loss of DHCP or PPPoE subscribers occurs during unified ISSU from Junos OS Release 16.1-20170718_161_r4_s5.0 to Junos OS Release 16.1-20170718_161_r4_s5.0. PR1294709
During PPPoE subscriber login errors like [ vbf_flow_src_lookup_enabled] and failed to find iff structure,ifl ] were seen on the FPC. PR1294710
On MX Series routers in a dual-stack subscriber scenario, if the DHCP dual-stack subscriber's CoS is in both the client profile and the service profile, when the second family DHCP client logs in, the CoS of the service profile might be overridden by the CoS of the client profile. PR1296002
In case of reaching the fire alarm threshold, the chassis might trigger shutdown with an incorrect high temperature timer log message that does not indicate the fire condition. In case of a fire condition, chassis shutdown wait time is 5 seconds. However, in case of high temperature, it is 240 seconds. PR1298414
In a subscriber management environment, the bbe-smgd process might crash if the traceoptions are enabled because of an invalid username that contains a format specifier (for example, the character "%" ) that cannot be successfully handled by the traceoption process. PR1298667
The I2C bus cannot withstand noise. On providing support for enhanced AC NON-HC PEM, a susceptible to noise, which suceptible software enhancements are made to suppress the I2C bus errors. PR1299284
At the time of ESSM subscribers' login and logout, flat accounting files are generated out of the configured period in MX960 BNG running Junos OS Release 16.1R4-S5: user@router> file list detail /var/log/rpm-5* Aug 18 11:20:17 -rw-rw-rw- 1 root wheel 69 Aug 18 11:20 /var/log/rpm-5-minutes -rw-rw-rw- 1 root wheel 914 Aug 18 11:19 /var/log/rpm-5-minutes.0 -rw-rw-rw- 1 root wheel 914 Aug 18 11:14 /var/log/rpm-5-minutes.1 -rw-rw-rw- 1 root wheel 914 Aug 18 11:09 /var/log/rpm-5-minutes.2 -rw-rw-rw- 1 root wheel 914 Aug 18 11:04 /var/log/rpm-5-minutes.3 -rw-rw-rw- 1 root wheel 1084 Aug 18 10:59 /var/log/rpm-5-minutes.4 -rw-rw-rw- 1 root wheel 1183 Aug 18 10:57 /var/log/rpm-5-minutes.5 ---> this file created out of order -rw-rw-rw- 1 root wheel 1182 Aug 18 10:54 /var/log/rpm-5-minutes.6 -rw-rw-rw- 1 root wheel 1182 Aug 18 10:49 /var/log/rpm-5-minutes.7 -rw-rw-rw- 1 root wheel 913 Aug 18 10:44 /var/log/rpm-5-minutes.8 -rw-rw-rw- 1 root wheel 113 Aug 18 10:41 /var/log/rpm-5-minutes.9 ---> this file created out of order -rw-rw-rw- 1 root wheel 113 Aug 18 10:39 /var/log/rpm-5-minutes.10. PR1299597
After GRES, the subscriber database might get stuck because it is not ready. The following CLI output is seen: user@router> show subscribers Database status: The database is not yet ready.Please try after some time. PR1299940
If nonstop active routing (NSR) is enabled, BGP uses Rsync (a TCP-based protocol for synchronizing files) to synchronize data between the rpd on the master Routing Engine and the backup Routing Engine. When some routing-instance specific configurations (such as auto-RD or route targets) are committed and a BGP Rsync error (such as a transport error causing the BGP Rsync connection to go down) occurs at the same time, a timing issue might lead to an rpd crash. PR1301986
The default interrupt threshold might cause unwanted MIC reset when interfaces on an enhanced MIC flap continuously. The fix of this PR introduces a hidden CLI configuration: set chassis fpc <> pic <> interrupt-threshold <> (pic-slot takes only 0 or 2 as valid PIC slots). It provides flexibility to the user to make the interrupt threshold configurable to avoid a false positive (unwanted MIC reset). PR1302246
With protocol-independent load balancing for Layer 3 VPNs enabled (that is, configure routing-instances <routing instance name> routing-options multipath) in a virtual routing and forwarding (VRF) routing instance, when toggling a TTL action statement (that is, vrf-propagate-ttl and no-vrf-propagate-ttl) for this VRF routing instance, if BGP receives a VPN route update for the VRF during the processing of the reconfiguration, the rpd might crash. This is a timing issue due to the race condition. PR1302504
Slow chassisd memory leak might occur because of the SNMP polling of entAliasMappingTable (1.3.6.1.2.1.47.1.3.2). During polling of entAliasMappingTable, the memory might not be freed, thus leading to the leak. PR1303061
In the subscriber management scenario with point-to-point protocol (PPP) enabled, the PPP interfaces might use the unreasonable default MTU (1500) on interfaces in some situations such as, when the PPP LCP packet containing the MTU sent from the device is rejected by the clients and a PPP MTU is not defined in the dynamic profile. PR1303175
On MX Series platforms, in a PPPoE over aggregated Ethernet interface scenario, after rebooting the aggregated Ethernet member leg FPC, the point-to-point protocol (PPP) keepalive echo requests might stop being generated on aggregated Ethernet interfaces. PR1303249
On routers with XM-chip-based line cards (for example, MX Series routers with MPC3E, MPC4E, MPC5E, MPC6E, MPC2E-NG, and MPC3E-NG), log messages might report fan speed changes between full and normal speed continuously, because XM-chip reaches a temperature threshold. PR1303459
The following kernel messages are seen: GENCFG: op for <type> failed; err <id> <error-string> For example, %KERN-1-GENCFG: op 15 (Firewall) failed; err 1 (Unknown) were incorrectly classified as alert message (Severity 1). Those are debug messages, and can be safely ignored. This PR reclassifies those messages as Debug (Severity 7). PR1303637
In some rare cases, if the filter aci statement is present in the configuration the output of the show pppoe lockout CLI command might get truncated as shown in the following example: user@router> show pppoe lockout xe-0/0/0.1100 Index 368 Short Cycle Protection: circuit-id, Lockout Time (sec): Min: 1, Max: 300 Total clients in lockout: 0 Total clients in lockout grace period: 13 Lockout Time (sec): Min: 1, Max: 300 Total clients in lockout: 0 Total clients in lockout grace period: 89 Lockout Time (sec): Min: 1, Max: 300 Total clients in lockout: 0 Total clients in lockout grace period: 35 Lockout Time (sec): Min: 1, Max: 300 Total clients in lockout: 0 Total clients in lockout grace period: 1 Lockout Time (sec): Min: 1, Max: 300 Total clients in lockout: 1 Total clients in lockout grace period: 25 Client Address Current Elapsed Next IXIA#1#05#40:0.35 300 228 300 00:07:72:00:A1:42 Lockout Time (sec): Min: 1, Max: 300 Total clients in lockout: 0 Total clients in lockout grace period: 0 Lockout Time (sec): Min: 1, Max: 300 Total clients in lockout: 0 Total clients in lockout grace period: 1 Lockout Time (sec): Min: 1, Max: 300 Total clients in lockout: 0 Total clients in lockout grace period: 5. PR1304016
As a result of regression, introduced in Junos OS Releases 14.1R5, 14.2R3, 15.1R1, 15.1F2, and later releases, G.751-framed E3 interface traffic rate has been limited to 30 Mbps on certain MX Series MICs. This PR is to restore the correct E3 rate. PR1304344
RPF check strict mode causes traffic drop in the next-generation subscriber management release. This issue is triggered because source lookup fails. PR1304696
Commit fails with the error ffp_intf_ifd_hier_tagging_config_verify: Modified physical interface "si-1/1/0" is in use by BBE subscriber, active L2TP LNS client. Commit failure is specific to having an implicit hierarchy defined on the SI interface. PR1304951
MX Series routers send immediate interim accounting updates for the services pushed by SRC or RADIUS. PR1305425
With set system internet-options no-tcp-reset drop-all-tcp and NSR configured, you might see the messages kernel: %KERN-5: tcp_timer_keep: Dropping socket connection due to keepalive timer expiration repeatedly on the backup Routing Engine. There is no service impact from the condition that causes the message. PR1305729
On MX Series Virtual Chassis (MX-VC) setup or dual Routing Engine platforms, with scale-subscriber, license loss might be observed during Routing Engine switchover. PR1308620
On MX Series routers in the subscriber scenario, when CoS is applied to subscriber demux logical interfaces (IFLs), it might not work as expected. PR1308671
In the subscriber management scenario, a profile-add-request for a dynamic VLAN might fail, causing subsequent subscriber login for the same VLAN to fail. This is because of issues with internal data structure cleanup following the failed profile-adds. PR1309770
9000 out of 10,000 terminated subscribers go down during the unified ISSU from Junos OS Release 16.1 to Junos OS Release 17.3. PR1309983
Starting with Junos OS Release 15.1R1 with subscriber management configured (next-generation subscriber management release), the bbe-smgd process might report a memory leak after deleting and adding the address pool. It impacts the new subscriber login. PR1310038
In the subscriber management scenario with CGNAT configured, if the device is accessed by millions of sessions that both do not match any CGNAT rule and later are put in the dropflow, the MS-MIC or MS-MPC memory utilization might stay at a high level (RED zone) because of overloaded dropflow. This might also cause disruption of traffic flow. PR1310064
In the dynamic profile, when variable $junos-ipv6-address is used under family inet6 address, a /128 local interface is created, but it is not removed when the subscriber session terminates. When the subscriber is up, the assigned ndra prefix is added along with the local address 2a02:ed0:6000:b78::1/128 intf: 2a02:ed0:6000:b78::/64 user 0 ucst 61920 974 si-0/1/0.2147483650 2a02:ed0:6000:b78::1/128 intf 0 2a02:ed0:6000:b78::1 locl 52255 Logical interface si-0/0/0.2147483649 (Index 432) (SNMP ifIndex 755) ...... Addresses, Flags: Is-Primary Local: 2a02:ed0:6000:1::1 Addresses Local: 2a02:ed0:6000:a::1 Addresses Local: 2a02:ed0:6000:13::1 Addresses Local: 2a02:ed0:6000:19::1. PR1310752
On MX Series platforms, the counter at the PPPoE session logical interface gets incremented when a malformed PPPoE packet is received. PR1312998
On all MX Series platforms, if the PPPoE subscribers run on L2TP access concentrator (LAC) over dual-tagged VLAN and auto-sensed VLANs, all the packets that are being sent to the L2TP network server (LNS) might be dropped, because the LAC Ethernet pads the PPPoE packets with larger size. PR1315009
In the subscriber management scenario with PPPoE configured, bbe-smgd might crash when performing GRES during PPPoE subscribers login. This is a timing issue and only a part of the subscribers might get synchronized to the standby Routing Engine in this case. PR1318528
In the subscriber management environment, the bbe-smgd process might crash multiple times and fail to recover. PR1318887
In rare conditions, MS-MPC or MS-MIC might crash because of too many rekey packets after a new IPsec VPN tunnel is added. All the tunnels on that PIC might be brought down and traffic might be lost. PR1318932
At the completion of MX Series Virtual Chassis unified ISSU, the Virtual Chassis backup member chassis connection to the Virtual Chassis master SNMP daemon is impaired and does not reconnect properly. Performing a local Routing Engine mastership switch on the Virtual Chassis backup chassis corrects the SNMP connection and restores access to the Virtual Chassis backup Chassis MIB objects. PR1320370
An FPC degraded fabric condition detected is reported and FPC might be rebooted when fpc-offline-on-blackholing is configured. The trigger in the FPC has only one Packet Forwarding Engine on this slot, but the FPC, which has two Packet Forwarding Engines, was installed on this slot earlier. PR1320774
For digital subscriber line (DSL) subscribers such as PPPoE, when a customer premises equipment (CPE) device is administratively powered off, the BRAS terminates the subscriber as expected upon the expiry of configured PPP link control protocol (LCP) keepalive value. However, in a scaled scenario, a few subscriber sessions remain active even after the keepalive has expired. As a result, the same CPE (client) cannot reconnect unless the former sessions are cleared or deleted from the server or the client waits for an extended amount of time to make sure the server internally clears those sessions. PR1320880
In the subscriber management environment, MX Series routers might respond to DHCPv6 solicit and router solicitation requests before completing the PPP IPv6CP negotiations with the CPE. PR1321064
On MX Series routers and in scaled number of PPPoE dual stack subscriber scenario, the bbe-smgd process generates a core file after massive clients logout and login. PR1321468
After multiple iterations of MS-MIC going offline or online, the MIC interface logical interfaces remain down dbecause the Routing Engine fails to control PIC communication over the Packet Forwarding Engine. PR1322854
Starting in Junos OS Release 15.1R1 with enhanced subscriber management, snmp interface filters might not work for subscriber interfaces when "interface-mib" is part of the subscriber dynamic profile. Without "interface-mib" in the subscriber dynamic profile, there is no change in behavior. PR1324573
When some specific MPC cards (MPC3E, MPC4E, MPC5E, MPC6E, MPC2E-NG, and MPC3E-NG) work under high temperature (around 67 degree Celsius or higher), XM-DDR3 memory refresh interval is reduced and hence DDR bandwidth and Packet Forwarding Engine (PFE) forwarding capacity are reduced. As a result, traffic might get dropped. PR1325271
In a DHCP subscriber environment for MX Series routers with Apache Tomcat (the next-generation subscriber management) enabled for BNG, when smg-service is restarted or GRES is performed, the VLAN demux interface does not respond to the ARP request. PR1326450
In MX Series BNG, the CoS service object is not deleted properly for TCP and schedular. PR1326853
Host-outbound traffic is not rewritten ieee-801.pbits for dynamic subscriber logical interface over PS interface. PR1329555
The show services nat mappings address-pooling-paired command times out and fails. This occurs because MS service MIC takes more than 60 seconds to process Routing Engine command to timeout. PR1330207
The updated routes are sent by the routing protocol process (rpd) to SRRD with the same timestamp and these routes are sent by SRRD to it's clients. Inline J-Flow uses the timestamp field for identifying if it is an actual update; because the timestamp in the route is not changed, the route updates are ignored. PR1332666
On MX Series platforms with a PPPoE subscriber environment, in order to increase the overall system performance of subscriber access, after optimizing the session database (SDB) using short-term storage cache, the subinfo process might crash and cause the SDB of the MX Series router subscriber to experience a down event. As a result, the PPPOE subscribers might get disconnected from the MX Series router. PR1333265
The UID limit is reached in a large-scale subscriber scenario when dynamic profiles use UID variables, or CoS is configured in Junos OS Release 15.1 or later releases. PR1334886
The service creation fail in bbe_cos_iflset_conf_add and bbesmgd process might generate a core file. PR1336852
In some scenarios, when the discard interface configured with IGMPv3 joins as an existing multicast flow, because of the change in the outgoing interface list (OIL), the KRT queue gets stuck while multicast next hop (MCNH) is reprogrammed. PR1342032
The rpd and kernel go out of synchronization because of add-delete-add cases of multicast next-hop (MCNH), and the discard (dsc) interface is used as a part of MCNH. An rpd crash is seen on the master Routing Engine, and a KRT error along with a KRT retry message are observed. PR1342343
On MX Series routers with 100M SFP used on MIC-3D-20GE-SFP-E/MIC-3D-20GE-SFP-EH, SFP might not work if it is third party. PR1344208
High Availability (HA) and Resiliency
The rtsock message length that was sent by the ksyncd process to the kernel through rtsock was incorrectly set to IPC length. PR1052425
With GRES enabled and set system syslog file messages daemon any configured, a log message regarding ksyncd might be generated on the backup Routing Engine. PR1203163
The vmcore files are generated due to a GRES issue, which is caused by the VCP port flapping events. VCP port flapping leads to the communication drop between VCMM and VCBM, and then it caused a ksyncd initialization error occurs when ksyncd tries to cleanup stale states. The current retry counts of clean up are not enough to wait for the cleaning event to end, so the ksyncd-triggered vmcore generates a core file; however, the cleanup is finished in time. At the end, no ksyncd core file might be triggered. For fixing this kind of issue, Junos OS enlarges the retry count of ksyncd to provide more tolerance before generating the core file. PR1274438
Infrastructure
On MX Series platforms, the harmless log of invalid SMART checksum might be seen when performing software upgrade to specific releases (for example, Junos OS Release 15.1F5-S3, 15.1F6-S1, 15.1F7, 15.1R4-S3, 15.1R5, 16.1R1, 16.1R2, and Junos Os Release 16.2R1). PR1222105
If SSD contains a valid permanent (non-resettable) offline-uncorrectable-sectors positive value, smartd logs on the nonzero value by default for every 30 minutes, which is too frequent logging, considering that there has not been a change in the value. PR1233992
The show system users CLI output displays users who are not using the router. The request system logout CLI command cannot clear the stale telnet sessions. This is a cosmetic issue, because the command show system connection and the CLI process show only the current session: user@host> show system users 5:39PM up 8 mins, 3 users, load averages: 0.27, 0.43, 0.26 USER TTY FROM user@ host pts/0 172.27.208.216 5:36PM - -cli (cli) <---- old telnet session lab pts/0 172.27.208.216 5:38PM - -cli (cli) <---- old telnet session lab pts/0 172.27.208.216 5:39PM - -cli (cli) <---- current telnet session user@host> show system connections |match 172.27.208.216 tcp4 0 0 172.27.116.36.23 172.27.208.216.63830 ESTABLISHED user@host> start shell % ps -aux |grep cli|grep -v grep lab 21016 0.0 0.2 786268 50304 0 S 5:39PM 0:00.15 -cli (cli) %. PR1247546
When the configuration statement set system ports console log-out-on-disconnect is enabled, the Junos OS eventd process (daemon) blocks the console-open(). However, during this stage with the syslog console configured (always logs on console), any logging continues even if the console session is ended. When the console logging continues to be in the waiting status, the eventd syslog rotation freezes and some processes that are directly involved in logging in to the system would also go into the wait status, causing undesirable behavior. PR1253544
Legacy Junos OS kernel might generate a core file on userland_sysctl / sysctl_root / sysctl_kern_proc_env / panic_on_watchdog_timeout. PR1254742
On Junos OS devices with legacy Free BSD (Free BSD version 6.X) based on Junos OS, the devices might crash and reboot if there is a defect in the Junos OS SDK-based multithreaded application that has been used. PR1259616
On M10i or M7i router with Junos OS Release 15.1, the Routing Engine goes to db> prompt after setting mirror-flash-on-disk. Traffic forwarding might be affected. PR1260268
For TX Serites or TXP Series systems, the kernel synchronization process (ksyncd) might restart on all LCCs after executing the command clear interfaces statistics all when there is a large SNMP polling interval. PR1274095
The kernel might fail to finish all input or output before shutdown during the upgrade. And the upgrade might not succeed with the following reason: Could not find installation package. PR1298749
Interfaces and Chassis
FPC might crash if the packet passed by PFEMAN to PPMAN has incorrect length. PR1195703
On MX240, MX480, and MX960, IPV6 neighborship is not created on the IRB interface. PR1198482
The show interfaces terse routing-instance all command has the wrong display format when there are multiple addresses. PR1207272
When OAM connectivity-fault-management (CFM) MEP is configured on the LSI or tunnel interface that is on the DPC card, every time a DMM (two-way frame delay measurement) or 1DM (one-way frame delay measurement) packet is received, certain harmless error messages might be seen. This occurs because software timestamping is not being used. The fix addresses the timestamp and suppresses the logs as well. PR1232352
Under a particular condition in configuring the interfaces that have vlan-id/vlan-tags configured, the commit operation might fail with an error message. PR1234050
On MX240, MX480, and MX960 platforms with 4X10GE DPCE card, if the interface is configured with he unidirectional option and you run the commit check command, the dcd process might be in high CPU usage (for example, 96 percent), which impacts the configuration checking. PR1236088
A T3 interface configured with compatibility-mode digital-link might fail to come up because of an incorrect subrate. PR1238395
In some rare situations, the Ethernet connectivity fault management daemon (cfmd) might crash when committing a configuration where the CFM filter refers to a firewall policy. When hitting this issue, all CFM-enabled interfaces are down. PR1246822
When using static demux VLAN interfaces, the link local address is not synchronized between the kernel and the subscriber management process (demon). When using router advertisement on a static VLAN demux interface and not in a IP dynamic profile, a router solicit from customer equipment might not be answered by the MX Series router. This depends on which address the CPE is using. In this PR the option to configure the MX Series router to use EUI-64 address for the demux VLAN ensures that the addresses are synchronized between the processes. PR1250313
The snmp-set command fails when the FPC, PIC, and port have a value greater than 9. PR1259155
Routing table entries are not cleared after bringing down static subscribers. Access routes are not cleared after subscribers log out. PR1260240
In a dual-stack PPPoE subscribers environment, when the PPP session has been in "OPEN" state, if the router receives a conf-request message from the client, it then sends a term-request message as a reply unexpectedly. PR1260829
The jpppd process might report error messages about RLIMIT_STACK and RLIMIT_SBSIZE after issuing the command of show version detail. PR1262629
In a subscriber scenario, when traceoptions is enabled with the flag GRES under PPPoE, if the subscriber username contains a format specifier (for example, the character "%") that cannot be successfully handled by the traceoption process, pppd might crash. PR1264000
Benign messages might be observed with configuration changes in an MX Series Virtual Chassis environment: Mar 2 00:14:30 CHASSISD_IPC_WRITE_ERR_NULL_ARGS: FRU has no connection arguments fru_send_msg Global FPC 14 Mar 2 00:14:30 SCC fru_set_boolean: send: set_boolean_cmd Global FPC 14 setting hold-pic-online-for-fabric-ready on. PR1264647
In a PPPoE scenario, subscribers might get disconnected due to a keepalive failure when CPE is adding an additional data field in the PPP echo request. PR1273083
By default, in Junos OS, the minimum length of the CHAP challenge is 16 bytes, and the maximum length is 32 bytes. Without using the configuration statement challenge-length minimum XX maximum XX, MX Series routers do not initialize the default chap-challenge-length, which according to our documentation should be a minimum of 16 bytes and a maximum of 32 bytes. PR1280263
When an Ethernet OAM LFM session is configured, the line card hosting the LFM session might reboot after the configuration is committed. PR1283280
When executing Routing Engine switchover, the dcd process checks the aggregated Ethernet (AE) interface. The check fails if the aggregated Ethernet interface has a member interface with “framing” settings. The failed check triggers both the aggregated Ethernet interface and its member interface to flap. PR1287547
With the affected release, if MPC was restarted followed by GRES, the jpppd process does not read the new service physical interface in a sequence. The new LNS subscriber login with this interface fails in the jpppd process. PR1290562
The family inet shows as Not configured after adding or deleting the loopback address. PR1294267
With this change, you can configure delay-buffer-rate on inline LSQ interfaces. PR1300281
If one logical interface changes the virtual router (VR) state from master to backup, traffic might get silently dropped and discarded for other logical interfaces that share the same group ID on an physical interface. PR1305327
In PPPoE subscriber management scenario, if subscriber authentication fails, the subscriber logical interface will be in disabled state. This causes the jpppd process to drop the next LCP termination request packet from the subscriber, instead of answering it with an LCP Ack and closing the PPPoE session with a PPPoE active discovery termination (PADT) packet that might impact session setup for this subscriber. PR1311113
An invalid configuration results because of the deficient dependency check of interface and interface-set. A disabled or deactivated interface included in an interface-set might get committed without any commit error. This issue might cause dcd to get into inconsistent state, and result in continuous crash of process dcd, chassisd, and mib2d after system reboot. PR1316976
There is no route to the IP address from the directly connected route on the static VLAN demux interface in case the configuration of the static VLAN demux interface is changed from unnumbered approach to the configuration with the explicit IP address (for example, /30). PR1318282
When running an MX Series router for BNG or subscriber management functionalities, the value shown in the dual-stacked subscriber IPv6 Framed Interface Id field (from show subscribers extensive output) is not matching the negotiated one. PR1321392
In PPPoE subscriber environment, continuous fault log messages might be seen on the backup Routing Engine. The issue does not have an impact on services. PR1328251
Multiple Virtual Router Redundancy Protocol (VRRP) groups are separately configured on different units of an aggregated Ethernet bundle, the unit 1 of which has both inner and outer VLAN configured. All the other VRRP groups might malfunction with a period of the time configured by failover-delay under VRRP stanza, after deleting aggregated Ethernet bundle unit 1. PR1329294
When the interface is configured as a member of interface-set, it might not work properly after an unrelated FPC (not the one where the interface resides at) restarts. The affected FPC is the restarted one. PR1329896
In some situations, like multiple commits in a short time with a scaled configuration, dcd memory leak might cause the commit to fail. PR1331185
When multiple VRRP sessions with the same group-id are configured on the same port (aggregated Ethernet interface or a physical interface), the VRRP virtual IP will be not reachable. PR1338277
J-Web
A remote, unauthenticated attacker might be able to execute through J-Web interface (CVE-2018-0001). Refer to https://kb.juniper.net/JSA10828 for more information. PR1269932
Layer 2 Ethernet Services
A new static MAC is configured under an aggregated Ethernet interface, but the MAC of the LACP PDUs sent out is not changed. PR1204895
On MX Series routers, if the chassis-level configuration is used to offline the FPC after detecting major errors, the FPC will be offlined. But if the committing configuration is performed after offlining the FPC, the FPC will be brought back online again. PR1218304
After changing the underlying physical interface (IFD) for a static VLAN demux interface, the NAS-Port-ID formed is based on the previous physical interface. PR1255377
In a large-scale unified ISSU testing, a MPC or FPC might go offline during the FRU upgrade phase of unified ISSU. PR1256940
The IPv4 and IPv6 packets originating from the Routing Engine might be corrupted when the bridge domain has vlan-id set to none, but the outgoing L2 interface for the packet is tagged and CoS is enabled. It only affects packets that originate from the Routing Engine but does not affect transit traffic. It affects both IPv4 and IPv6 packets. PR1263590
Delegated-IPv6-Prefix is not included in RADIUS accounting for PPPoE subscriber. PR1269062
On MX Series routers, BNG is configured as DHCPv6 local server for IPv6 prefix delegation alone when a DHCPV6 client bound to IA_PD prefix sends a request for IA_NA prefix. MX Series routers respond with a REPLY message with STATUS_NO_ADDR_AVAIL, which is correct, but it deletes the existing binding for PD prefix, which is not an expected behavior. PR1286359
On MX240, MX480, and MX960 platforms, due to a resources contention during multiple commit processes, the kernels might display I2C bus errors. PR1298612
MX Series router deployed as BNG for dual-stack DHCP or PPPoE subscriber management might eventually stop logging in new subscribers in case DHCP configuration is incorrect (for example, IPv6 address pool is defined too small), because of incorrect calculation of in-flight connections. PR1298976
After rebooting the router or after smg-service is restarted, DHCPv6 packets get dropped when a no-snoop configuration is used. The issue is observed in a setting where subscribers connect over a static VLAN demux interface. PR1316274
Layer 2 Features
A Junos OS device with VPLS routing-instances configured on one or more interfaces might be susceptible to an mbuf leak when processing a specific MPLS packet. Refer to https://kb.juniper.net/JSA10855 for more information. PR1272898
In a scaling VPLS scenario, convergence is taking more than 10 minutes (it is expected to take 20 seconds). Also, in VPLS topologies the kernel might report the error pointchange for TLV type 00000052 not supported on IFL <name> in
/var/log/messages
where <name> is a VT or LSI interface used by VPLS. Sometimes the issue can be reproduced by simply loading the configuration if the scale is high enough, but other triggers might apply as well. PR1279192In a virtual private LAN service (VPLS) scenario, any changes in VPLS configuration like deleting or re-adding VPLS instances or deleting or re-adding VPLS interfaces might cause the rpd process memory leak. The memory leak rate is 14 bytes per VPLS interface. PR1335914
MPLS
The rpd process might crash while restarting the interface control with LDP configured. PR1130494
The routing protocol process (rpd) might crash in the backup Routing Engine when LSP tunnels are present with an NSR configuration. PR1186292
With label distribution protocol (LDP) enabled, the deletion of an LDP entry (for example, LDP interface down) might cause many LDP entries to be deleted, which might result in routing protocol process (rpd) crash. PR1221766
Junos OS supports protocols MPLS in the VRF routing instance, but Junos OS does not support protocols connections (CCC) inside the VRF routing instance. However, when any interface under MPLS inside VRF routing-instance is configured and added, then it affects protocols connections (CCC) inside master, main, and default Instance. For instances, if any CE facing interface under MPLS protocols in any VRF routing-instance is configured and added, it is deleting the data structure containing CCC information as Junos OS does not have CCC information inside the VRF routing-instance. PR1222570
In an MPLS OAM environment, a rare timing condition might result in an rpd crash when a memory clean task is delayed. PR1233042
The routing protocol process (rpd) might stop running unexpectedly if a static MPLS LSP is moved from one routing instance to another routing instance in one single configuration change with one single commit. The rpd need to be manually restarted using restart routing command. PR1238698
In an MPLS environment, when a non-master routing instance with label switch path (LSP) is deleted and re-added, the rpd process might crash. The routing protocols are impacted and traffic disruption is seen due to loss of routing information. PR1241631
The ldp traffic-statistics configuration does not work appropriately for ACX Series, EX Series, M Series, MX Series, PTX Series, QFX Series, SRX Series, and T Series devices with Junos OS Release 16.1R4. The defect not only prevents periodical LDP statistic gathering but also causes kernel memory leak. Kernel memory leaks might lead to various side issues. PR1258308
In label distribution protocol (LDP) environment with traffic statistics configured, if there are failures with LDP traffic statistics collection, there might be a routing protocol process (rpd) crash during LDP shutdown. This is a timing issue. PR1264305
The routing protocol process crash might be seen if egress-policy is configured in LDP and the same route prefixes are in both inet.0 and inet.3. PR1266358
With LDP session-protection configured, the LDP session for the remote LDP peer for rLFA (remote loop free alternate) might still remain up, even after rLFA is disabled or after the remote targeted LDP session is no longer needed by rLFA. PR1266802
When MPLS builds the next hop for an mpls.0 route for the scenario with IDP over RSVP LSP over bypass tunnel and the IDP label is implicit-NULL, the label stack constructed for the next hop might be incorrect, with an invalid bottom label value of 1048575. PR1270877
At the transit node of a P2MP tunnel, the changes to the reservation state of a sub-LSP might inadvertently cause the reservation state of other sub-LSPs in the same session to skip refresh cycles, which might result in the reservation tears being sent upstream. Flapping of one sub-LSP might cause other sub-LSPs in the same P2MP session to be torn down. PR1272223
During LDP shutdown, a route added and deleted by LDP in the inet.0 table might be in the process of being deleted but still be in the inet.0 table. The show route extensive CLI command might cause the rpd to crash when trying to display the task name for such an LDP route. PR1272993
In an L2 circuit scenario, while processing an advertisement of LDP signaled L2 circuit, it gets stale binded because of the corrupted LDP structure. As a result, the rpd crashes. The reason for this corruption is not found and this issue is not reproduced. PR1275766
A crafted MPLS packet might lead to a kernel crash (CVE-2018-0003). Please refer to https://kb.juniper.net/JSA10831 for more information. PR1276786
The following log messages might be seen when you have an output firewall filter attached to the loopback interface: >>>>>> kernel: in_dfw_match: invalid IP version 1. This is caused by the incorrect parsing of MPLS l2ckt ping packets. The logs are completely harmless, and it does not mean that any packets have been discarded. PR1288829
The minimum maximum transmission unit (MTU) supported for MTU signaling in resource reservation protocol (RSVP) is 1488 bytes. If the ingress router of an LSP receives MTU less than 1488, it uses 1488 instead. PR1291533
When performing traceroute to a remote host for an MPLS path signaled by the LDP, the rpd process might crash. PR1299026
In rare conditions, where traffic engineering is configured and there are more than 4 addresses configured for the loopback interface, rpd process might crash when there are multiple interior gateway protocol (IGP) flaps. PR1303239
If BGP multipath is configured, and when the interface associated with one of the equal cost paths flaps and eventually comes up within BGP hold-time, the prefixes might be installed in the routing table only with the path corresponding to the flapping interface as the next hop. PR1305228
In some cases, it is seen that the label states are getting deleted twice, which results in routing protocol process (rpd) crash. This is applicable only when ultimate-hop popping (UHP) based label-switched paths (LSPs) are configured. PR1309397
When LDP egress-policy is configured for the BGP route and a label is received for a BGP route in inet.0 table from LDP, if BGP receives a new label for the same BGP route matching the LDP egress-policy, rpd might crash because of updating the new label. PR1312117
The show mpls container-lsp output does not show any egress LSP until the enhanced FRR is enabled for these egress LSPs. PR1314960
With the deployment of l2circuit on MX Series Virtual Chassis (MX-VC) and aggregated Ethernet interface works as core-facing interface whose member interfaces are across Virtual Chassis members (VC members), if the IPv4 and IPv6 multicast traffic comes in through l2circuit and goes out through aggregated Ethernet member interface across Virtual Chassis members, the traffic might get dropped on egress Packet Forwarding Engine (PFE). The egress Packet Forwarding Engine on other Virtual Chassis member recalculates the hash value after the new layer2 header and MPLS label is pushed, which results in a different hash value from the one calculated by ingress Packet Forwarding Engine, thus causing packets drop. PR1320742
For non-auto-bw LSPs, in a scenario where the some of routes resolving over the RSVP LSPs are withdrawn, the SNMP O.I.D counters for mplsLspInfoAggrOctets might show constant value for RSVP LSPs for a longer time (for more than a few cycles of the statistics sampling interval) and do not match the statistics of show mpls lsp statistics during that time. PR1327350
Packet loss might be observed when auto-bandwidth is enabled for circuit cross-connect (CCC) connections and label-switched-path (LSP) no-self-ping with no-install-to-address is configured. PR1328129
When there is an error during the creation of the RSVP path state (the PSB data structure), the data structure itself is freed but some associated memory is not freed. This causes a memory leak. This error condition occurs on a NSR master Routing Engine (or when no NSR is configured). But on the NSR backup Routing Engine, it is more likely to have conditions that cause the path state creation to fail, thus exposing the memory leak in the error handling code. Thus, this memory leak was seen on the NSR backup Routing Engine. PR1328974
When LDP processed BGP route for setting up an LDP to BGP stitching route, it might unnecessarily repeat updating the same route multiple times. This might increase the convergence time and result in additional traffic loss. PR1334764
Whenever there is a decrease in the statistics value across an LSP, the mplsLspInfoAggrOctets value takes two statistics intervals to get updated. The mplsLspInfoAggrOctets value holds the same value for two statistics intervals (including the statistics interval at which there was a decrease in statistics) and gets incremented from the next statistics interval onward. PR1342486
Multicast
Multicast traffic is not forwarded on the newly added P2MP branch or receiver due to multicast indirect next hop and alternate forwarding next hop (snooping route) goes out of synchronization when the receiver is leaving the group. PR1317542
Network Management and Monitoring
If max-events-queued is not configured, eventd process might crash when generating large amount of logging messages. PR1155756
Traps are sent as AgentX messages type (AGENTX_MSG_NOTIFY) from the subagent to the master agent. The subagent expects a response in form of an acknowledgment from snmpd after sending these AGENTX_MSG_NOTIFY messages upstream. If an ACK is not received from snmpd within 1 second (current timeout value) the subagent will resend the trap. After router reboot or GRES, a lot of upstream communication is triggered from the subagent to snmpd (traps or MIB registration messages). During this time, snmpd might not be able to send the downstream ACK within the 1 second period. This might trigger the subagent to resend the trap, which will be seen as a duplicate trap on the NMS. As a fix, we have increased the timeout value from 1 second to 5 seconds in the subagent. PR1164848
The CLI command snmp notify-filter is configured with wildcard characters for the following output. Example configuration: set snmp v3 notify-filter nf1 oid .1.*.6 include set snmp v3 notify-filter nf1 oid 1.2.3.4.5 mask 1.0.0.1.1 set snmp v3 notify-filter nf1 oid 1.2.3.4.5 include Before the fix: > show snmp v3 notify filter Filter Subtree Filter Storage Status name type type nf1 1.2.3.4.5 include nonvolatile active <<<< Here, 1.0.0.1.1 mask is not applied nf1 1.42.6 include nonvolatile active <<<< Here 1.*.6 is considered as 1.42.6. (Where 42 is the ASCII equivalent of wildcard "*") After the fix: > show snmp v3 notify filter Filter Subtree Filter Storage Status name type type nf1 1.*.*.4.5 include nonvolatile active <<<< Mask is applied correctly nf1 1.*.6 include nonvolatile active <<<< Wildcard "*" is treated as expected PR1185143
The statistics of OID ifOutError incorrectly includes ifOutDiscards. The buffer overruns are counted under ifOutErrors along with ifOutDiscards when SNMP Query is performed on ifOutErrors. PR1243071
One Routing Engine in an SRX Series cluster does not reply to an SNMP request. Either the primary node or the secondary node could be the target. PR1240178
A vulnerability in Junos OS SNMP MIB-II subagent daemon (mib2d) might allow a remote network based attacker to cause the mib2d process to crash resulting in a denial of service condition (DoS) for the SNMP subsystem. PR1241134
The command Esc-q does not work to toggle the console log or terminal log. The issue is seen on FreeBSD10 builds from Junos OS Release 15.1 and later. PR1269274
On Junos OS devices with SNMP enabled, a network-based attacker with unfiltered access to the Routing Engine might cause the Junos OS snmpd process (daemon) to crash and restart by sending a crafted SNMP packet. Repeated crashes of snmpd process might result in a partial denial-of-service condition. Additionally, it might be possible to craft a malicious SNMP packet in a way that might result in remote code execution. Refer to https://kb.juniper.net/JSA10793 for more information. PR1282772
The show arp no-resolve interface <interface-name> command is showing unrelated static ARP entries, which are fixed to display proper static ARP entries of the given interface. PR1299619
When an SNMP MIB view is attached to a community, the default views of "_all_" and "_none_" get added to the view linklist on each snmpd configuration update (SIGHUP) signal. This list can grow long and it causes the queries to loop through all the entries during view-based access control model (VACM) checks. This causes the CPU hike during SNMP query. PR1300016
With interface-mib configuration in dynamic-profile, when multiple OIDs are queried in a SNMP GET or SNMP WALK, the router might reply with No Such Instance currently exists at this OID for some of the OIDs. PR1329749
Platform and Infrastructure
Under a large-scale setup, VPLS MAC might not be aged out from remote Packet Forwarding Engine when Packet Forwarding Engine is MPC3, MPC4, MPC3E, and MPC4E, then unknown-unicast frames flood will be seen on the local Packet Forwarding Engine. PR1099253
Configuring a parameter of "broadcast 255.255.255.255" to an interface family inet when executing the commands show arp or clear arp causes a kernel crash. This issue might cause route flap, which impacts traffic. PR1120114
On ungraceful exit of telnet (quit or shell logout), perm and env files are not deleted. PR1142436
With the fix, XM-DDR3 boot diagnostics returns the test result of all XM-DDR3 components to the XM driver. If any XM-DDR3 component fails in the boot diagnostics test, the XM driver aborts the XM chip init process and reports hardware failure. The line card will not be brought online with any XM-DDR3 fail, causing a potential risk when sending corrupted packets to the remote Packet Forwarding Engines through the fabric streams. PR1166106
Internal fabric header corruption on Packet Forwarding Engines (on MX Series with MPCs or FPCs chipsets) can lead to packet corruption on the egress Packet Forwarding Engines. This PR effort is to protect the fabric header coming to the egress Packet Forwarding Engines with a fabric CRC check. This is shown to avoid wedges due to corrupted fabric headers. PR1170527
With the MAC accounting feature ethernet-switch-profile mac-learn-enable configured on an interface of MX Series based FPC, the limit of MAC database might be reached and the FPC crashes. PR1173530
On MX Series vMX platform, one firewall filter is configured with an action of policer and applied to aggregated Ethernet logical interface. Adding and removing this filter from aggregated Ethernet logical interface might not cause packets to be dropped. PR1176381
The issue occurs because of the access to a stale or invalid pointer that caused a particular check based on the pointer structure field to unpredictably fail, resulting in the assert later in the code. The issue occurs when a sequence of events related to firewall filters resulted in filter structure getting deleted and re-created again. PR1205325
After configuring a custom ARP policer after a reboot or vlan-id changes over a logical interface, the ARP policer configuration is overwritten by the default ARP policer. PR1210178
Routing protocol process (rpd) might restart unexpectedly after continuous flapping of the BGP connections. PR1221183
The error messages about LUCHIP(5) GUMEM1[77a0] mismatch might be seen after MX Series MPC card with LU chipset goes offline or online. PR1221195
Under certain conditions sync-other-re editing configuration warning might be displayed after reboot: user@host> configure exclusive warning: uncommitted changes will be discarded on exit entering configuration mode users currently editing the configuration: sync-other-re (pid 9220) on since 2016-10-03 00:16:36 PDT, idle 2d 05:47 sync-other-re (pid 9282) on since 2016-10-03 00:16:40 PDT, idle 2d 05:47 sync-other-re (pid 9333) on since 2016-10-03 00:16:49 PDT, idle 2d 05:47 sync-other-re (pid 9383) on since 2016-10-03 00:16:59 PDT, idle 2d 05:46 sync-other-re (pid 9433) on since 2016-10-03 00:17:07 PDT, idle 2d 05:46. PR1221723
Incorrect firewall filter to interface mapping might be observed after performing an upgrade to the affected release (Junos OS Release 15.1R4-S7,15.1R5-S2,15.1F2-S15,15.1F7,16.1R4,16.2R1-S3,16.2R2 and later releases), and then performing a GRES-disabled Routing Engine switchover. PR1224995
Next hop used for Routing Engine generated TCP traffic might differ from the one used for Routing Engine generated non-TCP traffic if the prefix is not subjected to 'then load-balance per-packet' action and is pointing to an indirect next hop resolved through unilist next hop (ECMP). Before the fix for PR1193697 this leads to non-TCP traffic generated from Routing Engine taking one unicast next hop while TCP traffic generated from Routing Engine is load balanced across different next hops. After the fix for PR1193697 this behavior might lead to non-TCP host outbound traffic taking one unicast next hop, while TCP host outbound traffic takes another. PR1229409
High MPC5 CPU on a scaled setup with 64,000 to 128,000 subscribers due to XQ background service that collects internal statistics. PR1233452
On MX Series routers with MPC5, MPC7, MPC8, and MPC9, when a low value of temporal buffer size (for example,10,000) is configured, the threshold in the drop rule in the Packet Forwarding Engine (PFE) differs from what is expected. PR1240756
The large scale of routes (for example: 900K), GRES and NSR, unified ISSU might fail. The master Routing Engine upgrades to a new software, but unified ISSU is aborted before GRES. PR1240788
With commit script configured, the management process (mgd) might crash when you configure anything in the private configuration mode. The problem is specific to private configuration mode edit private. It is not seen in regular configuration mode [edit] and if there is no commit script configured. PR1244015
When RADIUS accounting is configured, the Junos OS device tries the maximum number of times sending RADIUS accounting requests to a non-reachable RADIUS accounting server. When sending the request for the last time, the socket is closed because of the network down between Junos OS device and RADIUS accounting server, and the auditd might crash. Auditd process gets restarted automatically after the crash. Accounting continues to work after auditd crash. However, at the time of crash if there are some messages in the auditd queue that need to be sent out from Junos OS device to accounting server, those messages might get lost. After auditd gets restarted, the next event that has to be sent to RADIUS server will be sent normally. PR1250525
On rare occasions during the route add, delete, and change operation, the kernel might encounter a crash with the error rn_clone_unwire no ifclone parent. PR1253362
In a logical systems environment, if there are some failures that cause Routing Engine switchover (not performing Routing Engine switchover manually), the kernel routing table (KRT) queue might get stuck on the new master Routing Engine with the error ENOENT -- Item not found. PR1254980
Packets are not encapsulated with GRE header after disabling and reenabling the gr-interface, and GRE tunnel traffic might get dropped. PR1255706
On Junos OS Releases 15.1 or 15.1F with VRRP and PIM configured, when a router's VRRP mastership switched from master to backup, the router continued to use VRRP virtual MAC address (vMAC) for source address of all Ethernet packets leaving its interface. Network might be unstable and traffic might be affected because frames with the same MAC address would be received from different points. PR1257477
During unified ISSU, memory from the previous image related to hash tables is not properly recycled, which leads to physical memory block being left unused. The crash is triggered by an attempt to create a memory pool using one of these blocks. PR1258795
When a DHCP and BOOTP reply packet is received from an unnumbered interface, the FUD process might fail. PR1260623
After a unified ISSU upgrade, the WRED drop profile might not be programmed correctly, resulting in an incorrect WRED drop. PR1260951
Error handling actions and an alarm when a DDRIF memory checksum error situation is detected on an MQ chip-based MPC have been added. Without this change, the system only reports such errors but does not take any action. PR1260983
The error message rnh_iff_delete_nh: no pat-node that might be seen when subscriber logs out is innocuous and its severity is reduced to debug in the releases with the fix. PR1263983
Due to the transient hardware events, fabric stream might report CPQ1: Queue underrun indication - Queue <q> continuously. For such events, all fabric traffic is queued for the Packet Forwarding Engine reporting the error, resulting in a high amount of fabric drops. PR1265385
In Junos OS, when a new line card or a service card comes online, the real-time performance monitoring (rpm) process might receive the following error message: GENCFG: op 9 (RPM Blob) failed; err 1 (Unknown). PR1266336
MX Series routers with FPCs might crash generating a core file when interface specific firewall filters are configured with policers. PR1267908
On all platforms, fast flapping of interfaces or fast changing of configurations might cause an rpd crash and BGP sessions flap quickly. PR1269116
In rare cases, the Packet Forwarding Engine might drop the TCP RST (reset) packet from the Routing Engine side while doing GRES or flapping an interface, and traffic might be dropped. PR1269202
On MX Series with MPCs or FPCs-based platform when the total quantity of QoS enabled objects is few, the bps rate of queue statistics is sometime showing more than 100 percent than the actual pumped traffic. PR1271055
On MX Series routers with MPC line cards, if the IRB index gets an invalid value because of an unknown reason and the IRB interface is deleted or any configuration change is made for this IRB interface, an MPC crash might be seen. PR1281107
In a dual Routing Engines (RE) scenario, if one Routing Engine is running a release with image named jinstall-** (Junos OS Release 15.1 and prior releases are jinstall) and the other Routing Engine is running a release with image named junos-**, a password might be required when logging from the Routing Engine with jinstall image to another with Junos OS image using CLI command request routing-egine login other-routing-engine. The issue leads to the inability of transferring files between Routing Engine or performing a synchronized commit. PR1283430
From Junos OS Release 15.1, if aggregated Ethernet interfaces with child legs are anchored on an MQ-based MPC without queuing chip (that is MPC(E)-3D-16XGE-SFPP, MPC1(E)/MPC2(E) without Q on MX Series platform, and EX9200-40T, EX9200-40F, EX9200-40F-M on EX9200), the aggregated Ethernet bundle might operate in the restricted queue mode because of an incorrect code. The restricted mode results in the upper queue numbers (#4 - #7) being mapped back up to queues (#0 - #3). So the traffic that is destined to queue #4 might be actually sent out on queue #0 and so on. PR1284264
In Junos OS Releases 14.2, 15.1, and 16.2, split horizon feature for L2 packets is broken while enhancing some other features. As part of this PR, a split-horizon check was added to discard the packets going out on the same interface on which they were received. PR1286193
The issue occurs on an MX Series router installed with both MS-DPC and data MPC cards, the network service is configured in enhanced-IP mode, and the ae interface is configured on the MPC card. If the member interfaces of the ae interface are under a different Packet Forwarding Engine, the outbound traffic from the ae interface might experience incorrect load balancing. If the traffic is received from MS-DPC and exits from the ae interface on MPC, the egress traffic is transmitted to only one member interface of the ae interface instead of all. PR1287086
The show system resource-monitor fpc slot <> reported 'mem free' percentages that were not accurate. Earlier generations of FPC used EDMEM only for next hop /FW; later generations of FPC can expand into DMEM. This PR takes into account these differences and ensures the next hop /FW memory free % values are correct. PR1287592
If the next-hop address defined in the 'forwarding-options next-hop-group' is reachable through multiple interfaces, there might be a memory leak on MX Series with MPCs or FPCs based card when the ARP entry for this next-hop address changes from one interface to another interface. PR1287870
In MX Series with MPCs or FPCs-based MPC scenario, if aggregated Ethernet has more than one child link hosted on different Packet Forwarding Engines, and the previous device load-balanced the stream (based on L3 or L4 fields) to multiple links of the aggregated Ethernet, due to a software defect, the source media access control (MAC) address learned from cross Packet Forwarding Engine aggregated Ethernet might keep bouncing between aggregated Ethernet member Packet Forwarding Engines for a long or infinite time and might cause MLP-ADD storm. PR1290516
When the RPM http-get feature is running, the rmopd process gets stuck at sbwait state if the HTTP agent does not respond properly. PR1292151
On MX Series routers running the subscriber management feature, the scale subscriber license might not be cleaned up after bulk subscribers log out. When the number exceeds the license limitation and once the Routing Engine becomes the master, no new subscriber can be logged in. PR1294104
Traffic can get dropped in egress Packet Forwarding Engine due to hashing mismatch between ingress and egress Packet Forwarding Engine when IRB over aggregated Ethernet is configured in VPLS scenario. PR1300789
On MX Series platform with firewall filter configuration, MPC reset might cause Packet Forwarding Engine (PFE) crash for packet buffer error (which is full). PR1300990
When the total number of available CoS queues on an MPC Type 1 or Type 2 with an enhanced queuing chip (QX chip) is limited with the chassis fpc max-queues configuration, some interfaces might start dropping all traffic as Tail-/RED-drops. PR1301717
The Type-P Descriptor format of the TWAMP Request-TW-Session message is not RFC compliant. PR1305752
On MX Series router MPC3 or MPC4, when the fabric header protection feature is enabled, the DRD parcel timeout errors might be seen. PR1320874
Starting from Junos OS Release 14.2R1, the no-propagate-ttl might not take effect if chained-composite-next-hop ingress l3vpn extended-space is configured. The TTL value is still copied from the IP header to MPLS labels in the stack even though no-propagate-ttl is configured. PR1323160
On a multichassis system (TX, TXP, TXP 3D) with four LCCs, if more than 8 100G FPCs are configured with non-default forwarding-mode, the SFC's chassisd bounces PIC on LCC0-FPC0 at every chassisd's configuration change commit. PR1324745
The MAC address might not be learned on MX Series with MPCs or FPCs-based card because of the negative value of the bridge MAC table limit counter. PR1327723
If the commit script generates an invalid configuration and corrects the same by deleting the configuration and after a commit with synchronize configured, the patch might generate 0 bytes size, instead of actual diff. Jan 9 10:49:43 re0-abc mgd[3672]: UI_CMDLINE_READ_LINE: User 'netops', command 'commit synchronize force ' Jan 9 10:50:16 re0-abc mgd[3672]: UI_CFG_AUDIT_OTHER: User 'root' delete: [class-of-service interfaces xe-2/1/0] Jan 9 10:50:23 re0-abc mgd[3672]: UI_COMMIT_PROGRESS:Commit operation in progress:filename
/var/run/db/juniper.db-patch.sync, size 0 <<<<
this message indicates no change in configuration; however, there is a configuration change. PR1329513Libpcap did not have support for PS and LT interfaces for Junos OS Release 16.2 and earlier branches. For Junos OS Release 17.1 and later, libpcap did not have support for LT interfaces. PR1329665
If the response is not received from the RPM server, pingResultsMinRtt, pingResultsMaxRtt, pingResultsAverageRtt, and pingProbeHistoryResponse are marked as "1" instead of "0". This defective value was set while converting the microseconds to milliseconds. Through this fix, when the 0 < RTT <=1 milliseconds, it is displayed as "1" in SNMP queries and if there is no response, it is marked as "0" as per RFC 2925. PR1333320
When an MPLS unicast next hop gets removed (for example, due to a network convergence event), the statistics associated with that next hop can be erroneously added to the output statistics of the respective physical interface, causing false abrupt increments in output packet and byte count. Logical interface statistics and CoS queues' counters are not affected by this problem and still provide accurate data. PR1338581
IPv4 GRPS traffic over an aggregated Ethernet interface might be affected if enhanced hash key gtp-tunnel-endpoint-identifier is configured. PR1347435
Routing Policy and Firewall Filters
On all Junos OS platforms with vrf-target auto configured under routing-instance, the rpd might crash after an unrelated configuration change. PR1301721
If any part of the policy (ssm-map-policy or group-policy or oif-map) is changed under an IGMP interface, committing the configuration might fail. This is because of the deficient computing method for the total characters of policy under an IGMP interface. This causes the calculation result to exceed the limit. PR1327075
Routing Protocols
For FEC 129 VPLS (also known as LDP VPLS with BGP-based autodiscovery), if abandoned VRF and VPLS instances are left after all of the other pieces of configuration are removed, and the BGP protocol is deactivated in the master instance, the rpd process might crash continuously when a new configuration is commited. PR1006689
The monitor component in sshd in OpenSSH before 7.0 on non-OpenBSD platforms accepts extraneous username data in MONITOR_REQ_PAM_INIT_CTX requests, which allows local users to conduct impersonation attacks by leveraging any SSH login access in conjunction with control of the sshd uid to send a crafted MONITOR_REQ_PWNAM request, related to monitor.c and monitor_wrap.c (CVE-2015-6563). PR1116227
When applying add-path prefix-policy to neighbor level, all neighbors are separated into different update groups. This is not the expected behavior. There is no service impact. But, if all the neighbors are configured under one peer group with a huge number of peer groups, the scaling and performance might go down. PR1137501
BGP session flaps while changing add-path configuration at the group level for a family that is not configured at the neighbor level. PR1173773
FPC crashes when show ppm info is executed. PR1174977
The rpd might crash when a new PIM interface with the same SNMP index and name as the one that already exists is added to the SNMP index tree. The fix allows the new PIM interface to be added by removing the old one from both the name and SNMP index tree. PR1178589
In a dual Routing Engines scenario, if OSPF protocol is configured with MD5 authentication, after Routing Engine switching, the OSPF session might flap for authentication failure. PR1198179
In large-scale BGP route environments with multipath configured, if BGP sessions go down simultaneously, the rpd might crash because it cannot finish multipath cleanup within a 10 minute limit. PR1209695
When IS-IS is configured with overload timeout of 60 seconds and fragmented LSPs exist (for example, 25 IS-IS neighbors + 10K IPv4 routes + 1K IPv6 routes), if the link flaps or the neighbor down or restart routing event is triggered, the IS-IS routes might be missed in the routing table, which might cause some protocol sessions to go down and traffic loss. PR1213166
On Juniper Networks devices with BGP flowspec and graceful restart for BGP configured, after the Routing Engine switchover, the firewall filter __flowspec_default_inet__ might be missed, causing BGP flowspec to not work correctly. PR1213227
With the existing code, the default MoFRR behavior is sticky for both ECMP and non-ECMP cases. With the sticky option, when the active link goes down, the active path selection gives preference to backup path to get transitioned. The active path does not follow a unicast selected gateway. While this behavior works perfectly fine for ECMP cases, it leads to an issue for non-ECMP cases, where MoFRR can choose an LFA path (which is a unicast backup) to become an MoFRR active path. This results in failure of multicast forwarding. New MoFRR behavior: The expected behavior is that - "MoFRR should never choose a unicast LFA path to become an MoFRR active path. The LFA path can only be selected to become a MoFRR backup." In order to rectify the mentioned issue, the default MoFRR behavior for non-ECMP cases changes to non-sticky (with non-sticky option, the selection of active path strictly follows unicast selected gateway). With this fix, while selecting the MoFRR active path, the LFA path is never selected. Also, in case the active link goes down, this fix will avoid LFA backup to transition to active. In such a case, unicast selected path transition becomes active. Note: For ECMP, the default MoFRR behavior remains Sticky. PR1217350
The routing protocol process (rpd) on a backup Routing Engine might restart unexpectedly in a large BGP NLRI environment. PR1220651
In the rare scenario with a maximum number of routes in the BGP RIB_OUT table (for example, there are more than 700K BGP routes in route table), the rpd process might crash after performing BGP flapping. PR1222554
On all platforms, if MPLS goes down due to link flap, FPC reboot, or restart, rpd core files could be seen. PR1228388
The Junos OS OpenSSH memory exhaustion is seen because of the unregistered KEXINIT handler (CVE-2016-8858); Refer to https://kb.juniper.net/JSA10837 for more information. PR1228873
In a rare condition after a BGP session flaps, BGP updates might not be sent completely, resulting in BGP routes being shown in the advertising-protocol table on the local end but not shown in the receive protocol table on the remote end. PR1231707
In a PIM scenario with BSR configured, after deleting a static rendezvous point (RP) configuration from another router, then checking an RP table on a BSR router, there might be a stale bootstrap RP entry (which is the static RP deleted from another router) in the RP table. PR1241835
Session uptime in show bfd session detail output omits seconds if uptime is longer than 24 hours, which is different from similar output for LDP, OSPF, or BGP. Seconds are always included into the corresponding outputs for these protocols. PR1245105
If the same multicast group is also the member of different bridge domains with different interface routing and bridge (IRB) interfaces (for example, IRB1 with multicast group 1 in bridge domain 1, and IRB2 with multicast group 1 in bridge domains 2), when one of the receivers leaves the multicast group and the IRB interface is disabled and then enabled, multicast traffic for the remaining receivers in the group might get lost. PR1245297
When the advertise-from-main-vpn-tables configuration statement is used under BGP and the route reflector (RR) functionality is added, a refresh message is not sent, resulting in some missing routes. PR1254066
The rpd might crash in subscriber management deployment while adding a static route with the qualified next hop set to an assigned subscriber address. PR1262261
When the policy with damping is applied on BGP, the rpd might crash after deactivating or activating protocol BGP, which can result in protocol flap or traffic drop. PR1272202
During a unidirectional BFD failure, when BGP graceful-restart (GR) helper mode kicks in, stale routes are not getting removed and this causes traffic to be silently dropped or discarded. PR1276497
In a BGP configuration scenario, the following log entry might be seen in the messages log under normal operation and should be ignored: rpd[11156]: %DAEMON-3: bgp_rt_send_msg_attr: too big attributes: avail 123. PR1276758
The rpd process generates a core file due to BGP UPDATE with malformed optional transitive attributes (CVE-2017-10618). Refer to https://kb.juniper.net/JSA10820 for more information. PR1279204
In a BGP scenario with NSR configured, after GRES, when sending or receiving bgp-updates, while flapping several peers, the CPU utilization of rpd might stay at 100 percent for about 2 hours. PR1280583
In a BGP label unicast protection scenario with the statement per-prefix-label configured, rpd might crash because of a certain chain of events. If a BGP route with the indirect next hop is received first and later another BGP route with the direct next hop (which has the same prefix as the route received earlier) is received, then the prefix is advertised at least on the group. PR1282672
In a PIM sparse mode scenario, the second multicast packet sent by a multicast source might be discarded on the RP router. The first packet and third packet onward can be honored by the RP router. PR1282848
The rpd might crash if dynamic rendezvous point (RP) goes down in the topology with equal-cost multipath (ECMP) to RP and protocol independent multicast (PIM) join-load-balance automatic statement is configured. PR1288316
BGP-RR sends full route updates to its RR clients when any of the interfaces with the family-mpls interface bounce because of any fiber cut or manual events, causing high CPU spike. This happens when the process generates outbound soft-route-refresh through route update messages to the network peers. PR1291079
Multiple multihop BFD sessions to a common destination address are flapping on MX Series router. PR1291340
If a router works as a graceful restart helper during a peering establishment, the newly established peer might lose some of the negotiated capabilities and might interpret the updates incorrectly. This might cause peer drops or invalid routes. PR1293174
If LACP, link fault management (LFM), CFM, or STP is configured, the unified ISSU might take more time to complete and the FPC might go offline. PR1298259
MSDP sessions might flap because data replication gets stuck between the backup and the master Routing Engine with a huge SA burst between peers. PR1298609
The rpd might crash due to malformed BGP UPDATE packet (CVE-2018-0020). Refer to https://kb.juniper.net/JSA10848 for more information. PR1299199
With BGP Prefix-Independent Convergence (PIC) enabled, the routing protocol process (rpd) might crash, generating a core file while deleting a multipath route. PR1302395
On Junos OS Release 16.1 and prior releases with BGP, prefix-independent convergence (PIC) and the RIB import feature enabled, if the intermediate IS-IS primary route is deleted, the rpd process might crash and a core file might be generated. This could cause routing protocols to restart. PR1303327
When BGP family inet labeled-unicast protection is configured, a BGP bypass route might be installed in inet.2. At the same time, if inet.2 is used as the RPF table, the bypass route might be used to perform an RPF check, which leads to an RPF check failure. PR1310036
In an IS-IS and IPv6 scenario, rpd might crash when the neighbor router is restarted, causing route churn. PR1312325
BGP route age was getting reset when after the inactive route or path flap. PR1312538
IS-IS SPF gets triggered by LSP updates containing changes in reservable bandwith in TE extensions. PR1313147
When Junos OS interworks with other vendors' device, the primary path of MPLS LSP might switch to other address even though strict is configured for primary path. PR1316861
In some circumstances, a route from a BGP peer in a VRF might have an incorrect multipath attribute. PR1317623
In a Layer 3 VPN scenario with maximum-prefixes and vrf-import or vrf-export configured, when the limit for maximum-prefixes is reached, increasing maximum-prefixes might not take effect immediately. The reason is that if vrf-import or export policies are present, Junos OS does not reapply the import policy in this situation. PR1323765
When route target filtering (RTF) is configured for Virtual Private Network (VPN) routes and multiple BGP sessions flap, there is a slight chance that some of the peers might not receive the VPN routes after the flapped sessions come up. PR1325481
Multiple next hops might not be installed for an internal BGP (IBGP) route received from a multipath-enabled peer when an active IBGP route from a non-multipath-enabled peer is changed to a new active route from a multipath-enabled peer because of interior gateway protocol (IGP) route update. PR1327904
A flag needed to update BGP about a change was reset leading to no further updates when the underlying LSP next hop changes. A dead next-hop type for an interface that has flapped (or the FPC reset) might be observed. This only impacts the cloned route (S=0). PR1333570
In LI IGMP joins are not processed with the passive allow-receive statement configured on the IGMP interface. In pre-LI, IGMP joins were processed and accepted with the passive allow-receive configuration. However, the timer to send the query was not started. Hence after the configured time (default is 260 seconds), the multicast group joined through IGMP join was deleted. PR1334913
Core files are seen with next-hop list. A minor update to align the labeling has been done. PR1342481
The routing protocol process (rpd) crashes while PIM is unable to identify the next-hop gateway address. PR1348550
Services Applications
If L2TP is configured under the [access-group] hierarchy, during commit or commit check operation, the pppd process might crash (the configuration could commit successfully). It might result in minimal system impact and it restores automatically. PR1108024
On MX240, MX480, MX960, and MX2000 Series in L2TP scenario, perform GRES while subscribers are connected and then disconnect the subscribers. Stale L2TP tunnel switch (LTS) entries are observed. PR1209555
With MS-MIC and MS-MPC used for NAT service, when changing the source-address under a NAT rule term for a BASIC-NAT translation type, all future traffic hitting the NAT term is dropped. PR1257801
L2TP congestion window is set to 128 instead of 1 when tunnel is created. PR1265001
In an IPsec scenario, the kmd process might crash after configuring a certain IPsec configuration by apply-groups. PR1265404
Account Session ID, interface identifier, and subscriber user name trigger attributes are optimized for a scaled subscriber management environment. If you include any of the other, non-optimized, trigger attributes in a scaled subscriber management environment, a significant delay might be observed between the time when the DTCP ADD message is sent and the time when forwarding starts for the mirrored traffic. For example, if there are 10,000 subscriber sessions on the router, forwarding of the mirrored traffic might be delayed for 20 minutes. This delay occurs when you specify any non-optimized attribute, with or without any optimized attribute. The delay occurs regardless of the order of attributes in the DTCP packet. PR1269770
When a NAT pool is shared between port control protocol (PCP) and standard NAT, the PCP mappings cannot be manually cleared. PR1284261
Jl2tpd process restart should be avoided. GRES followed by jl2tpd process restart results in the loss of subscribers. PR1293783
If some subscribers log in without Tunnel-Client-Endpoint from RADIUS, each subscriber session gets its own Layer 2 Tunneling Protocol (L2TP). PR1293927
In an L2TP subscriber management scenario, the jl2tpd process might crash on the new master Routing Engine after GRES operation because of a rare timing issue. PR1295248
Telemetry script running on the router starts an ephemeral jl2tpd process. This results in running jl2tpd with a commit check. As ERA was getting initialized, this triggered creation of ERA log files. This was executed even for a commit check condition. The fix for this PR moves the file creation to the L2TP main process. PR1302270
In an L2TP scenario, when MX Series router works as L2TP Tunnel Switching (LTS), LTS clients experience packet drop for large packets. Data packet size ranged between MTU and 3 bytes smaller would be dropped. This is because LTS fragments these large packets and forwards the corrupted packets to the adjacent router. The adjacent router drops these packets because of L3 incompleteness or checksum error. PR1312691
When using the command show services l2tp tunnel extensive, the data Tx and data Rx values might decrease when subscriber sessions go down after running for an extended time. PR1318133
Stale Layer 2 tunneling protocol (L2TP) routes might be seen when L2TP peer uses any UDP port other than the default 1701. PR1322197
Aborting (using Ctrl+C) two commands by using the same management socket pointer, one after the other, might result in generating a core file. PR1337406
Software Installation and Upgrade
On a router running Junos OS software based on FreeBSD 10 and built before August 8th, 2017 with a Junos Selective Update (JSU) package, if the router is rebooted, the JSU package is not loaded properly. This means that the JSU package is no longer effective. As a result, the router is exposed to issues that are fixed in the JSU. PR1298935
Subscriber Access Management
On MX240, MX480, MX960, MX2010, and MX2020, jdiameterd might core if Tx control elements are pushed out of order by the device itself. PR1153776
In rare cases, in a subscriber environment, the authentication request might not cause authd to send the RADIUS REQUEST message to the RADIUS server. The log message Failed to queue the request, will be queued in authd internal queue might be observed. PR1178813
In a PPPoE subscribers scenario with a large scale of subscribers (for example, 3000), during operation of login and logout, some subscribers might be stuck in an error state of "Terminated". This issue impacts the traffic for these subscribers. PR1262219
Accounting messages are sent with the wrong timestamp to the RADIUS. PR1262892
In Junos OS Release 141X50, excluding tunnel attributes in access-request, accounting-start, and accounting-stop messages is allowed. In Junos OS Release 15.1TH and later, excluding tunnel attributes in access-request message is allowed (accounting-start and accounting-stop are already supported). PR1264024
Call rate performance might be impacted under heavy load if there are large numbers of small linked address pools because of a bug in the allocation traversal algorithm. PR1264052
The show network-access aaa statistics radius detail command can display an incorrect number of messages to the RADIUS server if the configured RADIUS servers are continuously flapping. PR1267307
DNS is not assigned through the access-profile if the authentication-order is set to none. PR1273034
In a scaled subscriber management scenario, bbe-smgd might spontaneously crash after it was restarted from CLI. PR1277099
After the Virtual Chassis switchover, RADIUS-assigned addresses that do not belong to any configured pool are added to the pool incorrectly. PR1286609
An authd process generates a core file while terminating a large number of subscribers. PR1289215
Service interim for DHCP subscriber is not working in JSRC. PR1303553
When a scaled number of subscribers log in, a memory leak might be seen while clearing subscribers with the Junos OS script or manually. PR1312517
Missing service interim occurs for random users in JSRC scenario. PR1315207
When address-assignment pool linking is configured, the IP addresses assignment might allocate IP addresses from later pools before the earlier pool is depleted. This is caused by the mechanism change for the IP assignment from the introduced release. PR1323829
User Interface and Configuration
The mgd would crash if a VLAN or IRB interface is included as part of interface-range configuration. PR1186156
A core file is generated by commitd when deletion for a certain configuration is committed. Configuration is properly changed after commit even though the core file remains. PR1267433
VPNs
In the MVPN environment, IGMP joins on the egress PE device, but PIM is not enabled on egress PE interfaces. Egress PE has interface(s) with static IGMP joins or IGMP receivers. IGMP is disabled on an interface, outgoing interfaces (OIFs) in MVPN forwarding routes are not updated, and hence traffic is forwarded on an interface that is not running IGMP or PIM. PR1157404
In l2circuit scenario when backup-neighbor is configured, the l2ckt process to reparses the PWs. While the PWs are in switchover state, rpd might crash. This is a timing issue caused by a race condition. Traffic loss and routing protocol peer restart might be seen during rpd crash. PR1182394
Under certain conditions, the PIM register-stop packet might be sent before the source tree join (Type-7) packet in a multicast virtual private network with Border Gateway Protocol (next-generation MVPN) scenario with only SPT mode configuration. This might cause some multicast packets to drop. PR1238916
The rpd memory leak is seen when next-generation MVPN type 6 and type 7 route adds, deletes, or changes occur. The leak is 36-byte block size on Junos OS Release 15.1 and prior releases, and 44-byte block size on Junos OS Release 15.1 and later releases. PR1259579
An rpd crash might be observed with a segmentation fault after applying an L2VPN configuration followed by the ping mpls l2vpn command. PR1272612
Memory leak occurs when PIM-MVPN is enabled for IPv4, and next-generation MVPN is not explicitly set to 'disable' for IPv6. As a result, rpd crashes because of the memory leak. PR1276041
When a Layer 2 circuit configured enabling NSR, an rpd crash might be observed on the backup Routing Engine when the Layer 2 circuit virtual-circuit-id is changed and committed. PR1345949
Resolved Issues: 15.1R6
Class of Service (CoS)
When the "chained-composite-next-hop" is enabled for Layer 3 VPN routes, MPLS CoS rewrite rules attached to the core-facing interface for "protocol mpls-inet-both-non-vpn" are applied not only to non-VPN traffic (which is the correct behavior) but also to Layer 3 VPN traffic. That is, both MPLS and IP headers in Layer 3 VPN traffic receive CoS rewrite. PR1062648
If the hidden command show class-of-service queue-consumption is executed many times (in this case, for 100 times), in a rare condition, the cosd process might crash with a core file generated. The core files can be seen by executing the CLI command show system core-dumps. PR1066009
The show interfaces queue <if_name> command has three display options: 1. show interfaces queue <if_name> Displays queued/transmitted/dropped packets/bytes for all IFD children. 2. show interfaces queue <if_name> aggregate Displays queued/transmitted/dropped packets/bytes for all IFD children except for IFD RTP traffic 3. show interfaces queue <if_name> remaining Displays queued/transmitted/dropped packets/bytes for IFD RTP traffic only. Note that unlike queued/transmitted/dropped counters, queues depth values cannot be aggregated. With changes done in this PR, the following is true for queues depth values: 1. show interfaces queue <if_name> Displays queues depth values for RTP queues. 2. show interfaces queue <if_name> aggregate Displays queues depth values for RTP queues. 3. show interfaces queue <if_name> remaining Displays queues depth values for RTP queues. The above logic is the same for physical interfaces, interface-sets, and logical interfaces units. PR1226558
On M Series, MX Series, and T Series routers with ingress and egress queueing enabled, input TCP is configured, but no output TCP on the logical interfaces. After you activate or deactivate CoS configuration, the cosd process might crash. PR1236866
The following error log message might be seen with Hierarchical CoS and strict-high scheduling configured. Dec 27 11:08:02.293 mand-re0 fpc1 cos_check_temporal_buffer_status: IFD ge-1/2/1 IFL 358: Delay buffer computation incorrect.^M If hierarchical scheduler is configured for a physical interface and if guaranteed rate is not set for a logical interface under this physical interface, then the temporal buffer is configured. The display of error message is valid when guaranteed rate is zero, but it is not valid when guaranteed rate is disabled. PR1238719
A round off issue that was leading to a difference in commit behavior of values such as 79m and 79.1m. PR1252505
Forwarding and Sampling
On all Junos OS platform, when the ifmon (that is, running the CLI command monitor interface <interface-name>) establishes a connection with the Packet Forwarding Engine process (pfed) and runs for a longer duration (as observed, the pfed has been running for more than 11 days), its multiple queries to the pfed might cause the pfed crash due to statistics counter wraparound. PR1151746
If a two-color policer is configured on MX Series with MPCs/MICs linecard, more traffic than the limited traffic might be passed when packets size is less than 128 bytes. PR1207810
Bandwidth-percent policer does not work on the ps interface, which will result in a commit error. PR1225977
Firewall filter family "any" with shared-bandwidth-policer on the MC-AE interface does not reconfigure bandwidth or carve up the policer when standby becomes active after A/S switchover; it drops all packets. PR1232607
With a sampling configuration, if you do not define a version for the second flow server, after committing the configuration, the backup Routing Engine might reboot. It might affect routing protocols replicating to the backup Right Engine. PR1233155
General Routing
Temp Sensor Fail alarm seen while ASMLC coming up. PR1036412
DPD/IKEv2 informational messages are dropped at the peer. Adding vendor ID in the INFORMATIONAL message is causing the peer to drop such packets. PR1066336
During IFL clean up "rtsock_peer_unconsumed_obj_add:object already deleted" log messages may indicate that the search bailed citing incorrect results. PR1085626
On Junos OS devices, if dot1x is configured, memory leak in kernel might occur that could lead to a system crash. PR1163782
In a very rare case, multiple Routing Engine switchovers might result in SNGPMB crash. The SNGPMB is the same thing as Switch Processor Mezzanine Board (SPMB). It is on the line card and contains the LCPU. It also manages locally discovered issues and the switch fabric via the chassis manager thread (CM), which communicates with the fabric manager thread (FM) in chassisd. PR1176094
If the MIC-3D-4XGE-XFP is used with MPC2E-3D-NG or MPC3E-3D-NG, the interfaces on the MIC-3D-4XGE-XFP connected to a DWDM device might flap continuously. PR1180890
When MS-MIC/MS-MPC is installed on MX, PIC card on MS-MIC/MS-MPC might crash in rare condition. This is a timing issue that might cause traffic loss and has no exact aspect of configurations for trigging that issue. Not as a workaround/restoration, please refer the external description step 3 for enabling dump under flow-control, which might arise more logs and help for engineer to diagnous that issue. PR1182807
On MX Series platforms, MS-MIC crash might occur. The exact trigger of the issue is unknown; normally, this issue might happen very rarely without any external triggers. The crash might occur with any services configuration, with core files pointing to Program terminated with signal 4, illegal instruction. PR1183828
On a Junos OS-based platform, CHASSISD_I2CS_READBACK_ERROR error might occur on a single occurrence of I2C read failure. These are transient errors. The errors might be seen randomly without any particular trigger. The fix is to suppress these messages. After the fix, these messages should be seen only when there are three consecutive I2C read failures. PR1187421
When VC-Heartbeat is configured, the MX Series Virtual Chassis split detection feature should cause the backup chassis to enter line card isolation mode, powering off all FPCs to force external gear to reroute traffic. A race condition in the mechanism can cause the backup chassis to also become protocol master, and leave its line cards in an operational state, which is undesirable. PR1187567
In rare cases, on MX240/MX480/MX960/MX2010/MX2020 Series platforms, MPC4 line card might never come back online after rebooting the chassis by request system reboot both-routing-engine command. PR1190418
Due to a bug in schema with Junos OS Release 14.1Rx and 15.1rx, administrators will not be able push MPLS configurations to devices that include loose strict tags. PR1193599
On an MPC5E, when the "chassis process" (chassisd) notices a high temperature condition on any sensor, it issues a high temperature alarm and increases fan speeds to high. PR1199447
An NPC core file might be seen during unified ISSU, and the unified ISSU might fail due to an ISSU ABORT error. PR1200690
When Path Computation Element Protocol (PCEP) is enabled and label-switched paths (LSPs) are undergoing changes, like a make-before-break (MBB) change for rerouting, the rpd has to send those updates to the Path Computation Element (PCE). However, when the PCEP session to the PCE goes down, these updates are cancelled, but the rpd fails to completely reclaim the memory allocated for these updates. This causes an increase in the rpd memory every time the connection to PCE goes down while LSPs are simultaneously going through MBB changes. This issue will be especially noticeable when connectivity to PCE goes UP and DOWN continuously. If the connection is in steady state, either UP or DOWN, then the memory leak will not happen. PR1206324
The l2ald might thrash when the targeted-broadcast is configured on EVPN irb. PR1206979
When using the show chassis hardware detail command in Junos OS 15.1 Release or later to display chassis components, the Compact Flash and Hard Disk serial numbers might be truncated to 15 characters. PR1209181
The BGP PIC Installs multiple MPLS LSP next hops as Active instead of Standby in Packet Forwarding Engine. This can cause a routing loop. PR1209907
When an ARP entry is learned through an AE interface and a route is pointing to that ARP nexthop, the ARP entry will not expire even if the ARP IP is not reachable. This issue occurs due to the route nexthop on the AE interface getting stuck in a unicast state even if the remote end is not reachable, and the RPD is unaware that the ARP is invalid. So, with this resolution, the route nexthop on the AE interface can be shown in the hold state when the remote end is not reachable. PR1211757
MS-MPC/MS-MIC might crash when large fragmented (larger than 2048 bytes) traffic goes through an ALG. PR1214134
Syslog message : fpc_pic_process_pic_power_off_config:xxxx :No FPC in slot y is displayed on empty FPC slots with no PIC power off configured by committing configuration change under chassis hierarchy.PR1216126
This issue happens only with RLT configuration and only on Junos OS 16.1 and later releases. PR1216991
Suspicious log messages like: vbf_ifl_bind_change_var_walker:363: ifl .pp.54615 (1073796438): FILTER (28) Bind change notify ran for 276701162891 us can be observed. PR1217975
Kernel crash and router reboot might happen when committing RLT configuration. PR1218326
On MX Series platforms, if you are replacing an MQ FPC (MPC Type1, 2, MPC 3D 16x10GE) with an XM one (MPC Type 3,4,5 6. 2E-NG, 3E-NG), all other MQ-based cards might report FI Cell underflow at the state stage. It Packets will be dropped. PR1219444
On MX Series platforms with enhanced subscriber management, performing a configuration commit that changes any dynamic profile data after the system has booted might result in login and logout connections per second (cps) performance degradation for subscribers using the dynamic profile. PR1220642
When fpc-pfe-liveness-check is configured, Packet Forwarding Engine liveness detection might incorrectly report a Packet Forwarding Engine failure event under a severe interface congestion situation. PR1220740
On MX Series platforms Virtual Chassis partial or complete traffic loss for streams via AE interfaces might be observed in certain scenarios. For example, if vcp ports were de-configured and re-configured again, then two consecutive global GRES switchovers were performed and the MPC hosting AE child links was reloaded, traffic loss would be observed after the MPC boots up due to incorrect programming of AE interface on its Packet Forwarding Engine. PR1220934
On MX Series with pppoe dynamic-profile and service-name-table xx configured, if configuring the prefix or any interface configuration and after committing, the output of show pppoe service-name-tables xx displays as Service Name Table not found: xx.PR1221278
In the dual Routing Engines scenario with scaled configurations, when events such as daemon restart or Routing Engine switchover occur, the ksyncd process or the backup Routing Engine might crash. This could impact the master Routing Engine in a scaled system because states between the master and the standby are synchronized after a crash. PR1221913
After Junos OS Release 15.1, the behavior of storage devices enumeration in kernel level has been changed. Device enumeration in legacy software prior to Junos OS Release 15.1 will show CF and Disk as ad0 and ad1, respectively. Device enumeration after Junos OS Release 15.1 will show CF and Disk as ad1 and ad0 instead in the result of show chassis hardware. This might be inconsistent for other results of output, such as show system boot-messages and show log messages. PR1222330
During change of authorization (CoA) requests, there are no changes in schedules. Requests are received successfully, but no changes are sent from the CoS side. PR1222553
Due to a defect related to autonegotiation in a Packet Forwarding Engine driver, making any configuration change to interface in MIC "3D 20x 1GE(LAN)-E,SFP" might lead to interface flapping. PR1222658
In an enhanced subscriber management environment (set system services subscriber-management enable) and when the remove-when-no-subscribers configuration statement is configured in auto-configure stanza, when the last subscriber logs out (which triggers the dynamic VLAN IFL removal) and immediately a new subscriber logs in before the IFL is set to inactive, dynamic profile deletion might fail. As a result, subsequent subscriber logins will also fail. PR1222829
The "unnumbered-address" under the dynamic profile shows the wrong value. PR1222975
The problem of tunnel stream getting misconfigured for LT interfaces was due to internal programming and has been corrected to evaluate multiple lT interfaces for FPC and PIC slot combination. PR1223087
In an MX Series platform Virtual Chassis with subscriber management environment, the bbe-smgd process might leak memory in the backup Routing Engine when running continuous subscriber login logout loop tests. It seems memory utilization increases with each login logout loop till it reaches 809 MB and it does not increase beyond that. PR1223625
In a PPPoE subscriber scenario, after the demultiplexer underlying interface AEx is changed to AEy, the source MAC used for PPPoE handshake is still the old AEx interface's MAC. This causes PPPoE clients to fail as the PADR packets from the client are dropped due to the MAC address mismatch. PR1224190
In a subscriber management environment log message "vbf_ifl_bind_change_var_walker:377: ifl .demux.22698 (1073764522): IFL TCP (38) Bind change notify ran for 1480 us" can often be seen. This log message is generated when time needed to complete execution of the routine exceeds 1ms, it is harmless and can be ignored. However, sometimes time calculation yields incorrect results and this issue has been corrected via this PR. PR1229967
The Routing Engine CPU used chassis temperature to decide fan speed. This PR has been fixed to use the real Routing Engine CPU temperature to decide the temperature threshold. PR1230109
On all platforms, for IPv6 static routes derived from weighted LSPs, unequal load balance does not work. PR1230186
The Random Load Balancing feature does not function; all traffic goes to one of the load-shared egress links instead of being shared across all the links. PR1230272
Due to a bug in Junos OS code, the interface statistics remain unchanged post-ISSU on the MXVC platform. This in turn leads to the RADIUS volume accounting value remaining unchanged post ISSU. This is a day-1 issue seen on the MXVC platform only after Junos OS Release 14.1.PR1230524
Unsuccessful DCE-RPC ALG sessions result in stale gates and lead to MS-MPC/MS-MIC restart when the gates clean up occurs after timeout. PR1230868
The dynamic-profile service filter matches the traffic that is not defined in the prefix-list applied to the filter. As a result, the filter does not work as expected or even match all the traffics. PR1230997
The ICMP identifier is not translated back to the expected value during traceroute for TTL exceeded packets on NAT using Multiservice MPC. This occurs for ICMP ID >255 and causes all hops (except first and last) appearing as "*". PR1231868
Input framing errors increment on interfaces connected to MPC2E-NG with 4x10G MIC when interface is configured in "wan-phy" mode. PR1232618
On the XQ-based linecard, in a rare condition, if offline/online the FPC or link flap, some error messages might be seen. PR1232686
High MPC5 CPU on a scaled setup with 64 - 128,000 subscribers due to XQ background service that collects internal statistics. PR1233452
When you set port-mirror to the MX Series router, LSP ping might fail and IP packets with options will not get mirrored due to the following unexpected echo reply from DUT: <----------------------------- echo request -----------------------------> echo reply [R1]------------[DUT]--------------[R2] A | -----------> echo reply (unexpected behavior) | mirror PR1234006
After the backup Routing Engine is replaced, the new backup Routing Engine cannot synchronize with the master Routing Engine if dynamic-profile-options versioning is configured. This is because the code checks if any dynamic profile is configured before enabling dynamic-profile-options versioning. If so, it throws a commit error. But there is no need to check when the Routing Engine is in backup state. PR1234453
KRT queue stuck happening because of socket buffer is sending some junk value to kernel and kernel is returning error 'EINVAL -- Bad parameter in request'. PR1234579
When non-Juniper SFP is used in MIC-3D-20GE-SFP-E or MIC-3D-20GE-SFP-EH MIC, the ISR 2 (MIC error interrupt) might be running off over 2.5 second due to unknown reason, and then the FPC host the MIC might be restarted and crashed. The fix add interrupt throttling for MIC interrupt and restarting the MIC if interrupts are more than the threshold (> 2500 per 5min). PR1235475
On MX Series platform, when per-packet load sharing is enabled under the aggregated Ethernet interface, egress traffic over the aggregated Ethernet interface might be dropped unexpectedly. PR1235866
When PIC-based MPLS J-Flow is configured and MPLS packets are being sampled at the egress (to be sent to the service PIC), the sampled packets do not reach service PIC which results in no MPLS J-Flow flows getting created. PR1236892
In an MX Series Virtual Chassis subscriber management environment, LI-enabled DHCP subscribers might experience packet drops because of MAC validation errors in the FPC. This issue was seen only when connecting the subscribers for the first time after rebooting the system. PR1237519
DNS server IP addresses are not present in the output of show subscribers extensive for DHCP subscribers when DNS configuration is provided from the access-profile or pool. When such data is provided from RADIUS, the output is correct. The issue is cosmetic: DNS addresses are provided to subscribers. PR1237525
Due to lack proper boundary checks in code, the MS-MPC might crash when receiving internally corrupted frames from other FPCs that have hardware failure or incorrect rewrite programming. PR1237667
Increased support of number of Routing Instances from 4K to 64K. PR1237854
MX Series platform is sending accounting interim without an update-interval configuration statement. PR1239273
Trace route will not resolve VRF loopback address where system integrator and pseudointerface exist. PR1240221
Subscriber Management: MIB ifJnxTable is not supported for subscriber interfaces. PR1240632
Session database synchronization might fail if the master Routing Engine or the master chassis in an MX Series Virtual-Chassis configuration (VC-M) is power cycled. PR1241162
In some cases, untagged bridged traffic might not be mirrored on the second port of the mirrored group. If untagged bridged traffic is to be mirrored/sent on two different interfaces of the mirrored group, traffic might be mirrored/sent only on one of the mirrored interfaces/ports. PR1241403
Routes learned over EBGP multipath peering might not get installed in the forwarding table, resulting in traffic being discarded for the affected destinations. This will only happen if in addition to EBGP multipath there is also a multihop configuration statement enabled for that peering and a unicast reverse path forwarding check is enabled over the involved interfaces. Corresponding routes would end up stuck in the KRT queue and related KRT log messages containing error code EINVAL -- Bad parameter in request would be seen in the logs. PR1241501
For MX Series Virtual Chassis, some VBF flows are missing after FPC restart. PR1244832
The power supply module (PSM) goes to Present State whenever there is a feed failure. The logic is changed to update the PSM state based on the number of feeds connected. PR1245459
Fragmented RPC packets can cause the MS-MPC and MS-MIC Service PICs to generate core files when using NAT with the RPC ALGs. PR1248397
The bbe-smgd generates a core file when duplicate UID variable names are used for different purposes in the configuration. PR1248725
Only one IA-NA dhcpv6 (without a prefix delegation (PD) request) could be bound in case two or more subscribers are provided with the same PD from RADIUS. For example, in case of several customer premises equipment (CPE) devices from a household, all sessions will be provided with the same ACI/ARI. If the username is formed based on ACI/ARI (so the username is the same for all sessions), RADIUS can provide the same PD for all sessions, this will allow only one session to be established even though the CPE devices did not request a PD. PR1249837
This issues occurs on MX960 routers with an MPC5E when queues associated with the L4 node get freed but the L4 node itself is not freed. When you try to free the L4 node, because the queues have already been deleted, you will receive a NULL queue code. The MPC crashes with the following message: qchip_disable_q_rates (q_chip=0x17931598, q_index=73016, q_node=0x0) at ../../../../../../src/pfe/common/drivers/queue-chip/qchip_rate.c:1801. PR1250335
smihelperd process can crash during subscriber logout process. PR1250760
When an IPv6 node receives an ICMPv6 PTB (Packet Too Big) message with MTU < 1280, the node will emit atomic fragments. This behavior might result in denial of service attack. PR1250832
On all Junos OS platforms that have rpd, if some interfaces go down, some peers will go down or BGP-RR(route-reflector) re-advertises routes and then the rpd (routing-protocol daemon) process might crash. PR1250978
On MX Series with MPC2E-NG/MPC3E-NG, the interfaces of these line cards might not come up when connecting to 3rd party transport switch. PR1254795
IRBs that are part of an L3 multicast group allocate ASIC memory when added to the group. A small amount of this memory is not freed when changes are made to the L3 multicast group. This could cause a crash because of an out-of-memory condition if there are continuous changes to multicast groups with IRBs over a long period.PR1255290
On some T Series routers, the LSI statistics are not shown in the aggregated Ethernet interface bundles and the input stats counter for the AE interface does not include MPLS traffic. PR1258003
MS-MPC/MS-MIC Service PIC constantly generates core files when NAT term calls application-set with no active applications: application-set EIM_ALG { inactive: application PS3C; inactive: application XBOX1; inactive: application XBOX2; inactive: application XBOX3; inactive: application XBOX4; inactive: application PS3D; inactive: application PS3E; inactive: application PS3F;}. PR1258060
Unable to run show subscribers extensive and some other CLI commands after GRES because subscriber-management database is unavailable. The other symptoms of the bug can be messages like sdb.db: close: Bad file descriptor and commit full. PR1258238
In a subscriber service environment, the device control daemon (DCD) might restart unexpectedly during commit process after changes to ATM interface configuration. PR1258744
It was observed that an authenticated dynamic VLAN interface is removed with an idle timeout if there are no subscribers on top and if "remove-when-no-subscribers" is configured at the auto-configure stanza. Such dynamic VLAN interface should be removed after its idle timeout expires and if it has stayed idle during this period. PR1262157
MX Series use incorrect routing table to send out the ICMP network unreachable message back to the source thus might cause some problem on the end user CPE. PR1263094
Dynamic VLAN interface is logged out after reaching idle timeout even though there is a client session (pppoe or dhcp) above it. The proper behavior is to keep the dynamic VLAN interface in case of a client session (pppoe or dhcp) is present above the dynamic VLAN interface. PR1263131
It is possible to see a bbe-smgd core under certain boundary conditions on the standby Routing Engine with certain specific configurations. Since the core is on the standby no disruption in service is expected and system recovers from this condition. PR1267646
High Availability (HA) and Resiliency
When nonstop routing (NSR) is configured in a group, and that group applied to routing options, NSR sometimes fails. To prevent NSR failure, configure the nonstop-routing statement directly at the [edit routing-instances routing-instance-name routing-options] hierarchy. PR1168818
On all platforms, when running unified ISSU, the connection might be broken between the master Routing Engine and the backup Routing Engine. PR1234196
In rare scenario, GRES might not reach ready state and fail to start, due to the fact that Routing Engine does not receive state ack message from Packet Forwarding Engine after performing GRES. This is a timing issue and hard to reproduce. It might also stop Routing Engine resource releasing and then cause resource exhausting. There is no effective method for restoration and the only way is to reboot the system. PR1236882
Infrastructure
In an RSVP scenario, when you provision RSVP LSP with ldp-tunneling enabled and these LSPs are configured with link protection, continuous kernel logs and an LDP statistics timeout error might be seen when executing show ldp traffic-statistics. PR1215452
Polling SNMP QoS queue statistics along with physical interface statistics might result in flat values for QoS queue statistics. The flat values could give a false impression that spikes are happening in the queues. PR1226781
On all Junos OS platforms and on the router with PIM enabled that has a local receiver, stale next hops are present because they did not get deleted by daemons due to a timing issue. PR1250880
Interfaces and Chassis
In rare conditions, FPC might crash when CLI command request chassis mic offline fpc-slot <fpc-slot> mic-slot <mic-slot> or request chassis pic offline fpc-slot <fpc-slot> pic-slot <pic-slot> is executed. This is due to a software defect in which SFP diagnostics polling function tries to access already destroyed SFP data structure by MIC/PIC offline. PR1204485
The dcd cannot start after router reboot due to nonexisting logical interfaces referenced in demux-options underlying-interface. PR1216811
In Junos OS Release 14.2 and later, if asymmetric-hold-time, delegate-processing, and preempt hold-time are configured, when the neighbor's interface comes up again, the asymmetric-hold-time feature cannot be used as expected. PR1219757
Previously the same IP address could be configured on different logical interfaces from different physical interfaces but in the same routing instance. Only one logical interface was assigned with the identical address after commit. Such behavior could cause confusion: there was no warning during the commit, only syslog messages indicating incorrect configuration. With the fix, it is not allowed to configure the same IP address (the length of the mask does not matter). PR1221993
The configuration change where for a static VLAN demux interface the underlying physical interface is changed to one with a lower bandwidth (for example, from xe to ge) can fail with the following error: error: Bandwidth on IFL demux0.7000 cannot be greater than that of its IFD. PR1232598
On an MX Series platform acting as a broadband network gateway (BNG), in Point-to-Point Protocol (PPP) scenario the router can send LCP Terminate-Ack packet after the PPP over Ethernet (PPPoE) PPPoE Active Discovery Terminate (PADT) packet. This behavior does not follow RFC 2516, which explicitly demands that when a PADT is sent, no further PPP traffic is allowed to be sent using that session, including normal PPP termination packets. PR1234027
On M7i and M10i devices, jnxOperatingState shows 1 (unknown) for Fan Tray 1. user@router> show snmp mib walk jnxOperatingDescr | match 4.2 jnxOperatingDescr.4.2.0.0 = Fan Tray 1 jnxOperatingDescr.4.2.1.0 = Fan Tray 1 Fan 1 jnxOperatingDescr.4.2.2.0 = Fan Tray 1 Fan 2 jnxOperatingDescr.4.2.3.0 = Fan Tray 1 Fan 3 jnxOperatingDescr.4.2.4.0 = Fan Tray 1 Fan 4 jnxOperatingDescr.4.2.5.0 = Fan Tray 1 Fan 5 jnxOperatingDescr.4.2.6.0 = Fan Tray 1 Fan 6 jnxOperatingDescr.4.2.7.0 = Fan Tray 1 Fan 7 jnxOperatingDescr.4.2.8.0 = Fan Tray 1 Fan 8 user@router> show snmp mib walk jnxOperatingState | match 4.2 jnxOperatingState.4.2.0.0 = 1 jnxOperatingState.4.2.1.0 = 2 jnxOperatingState.4.2.2.0 = 2 jnxOperatingState.4.2.3.0 = 2 jnxOperatingState.4.2.4.0 = 2 jnxOperatingState.4.2.5.0 = 2 jnxOperatingState.4.2.6.0 = 2 jnxOperatingState.4.2.7.0 = 2 jnxOperatingState.4.2.8.0 = 2 .PR1237255
If the maximum transmission units (MTUs) on BNG and CPE sides have different values, the MX Series router might calculate the MTU value for the corresponding pp0 logical interface incorrectly. PR1240257
If more than one logical interface (IFL) is configured under the same physical interface (IFD), and VRRP is configured on one IFL without VLAN and the lower unit number IFL has VLAN configuration, VRRP incorrectly carries the VLAN information from the lower unit number IFL to this logical interface configuration. As a result, VRRP might get stuck (state: unknown, VR State: bringup). This might happen if VRRP is configured on the physical interface with flexible-vlan-tagging or the lt interface without flexible-vlan-tagging. PR1247050
When using static demux VLAN interfaces the Link Local address will not be synced between kernal and subscriber management demon. When using router advertisement on static VLAN Demux interaface and not in IP dynamic profile, a Router Solicit from customer equipment might not be answered by the MX Series. This is dependant on which address the CPE is using. In this PR, the option to configure the MX Series to use EUI-64 address for the demux VLAN, will ensure that the addresses are synchronized between the demons. PR1250313
On Junos OS platforms, the cfmd process runs by default. When bridge-domain is configured, if you commit a configuration related to physical interface/Logical interface (IFD/IFL), cfmd memory leak might occur due to a software defect. As a result, the memory leak can cause cfmd to crash. PR1255584
MIC-3D-20GE-SFP-E or MIC-3D-20GE-SFP when reading out periodically SFP diagnostic information, due to misbehaving SFP or noise on the I2C BUS ,SFP thread might be hogging and watchdog check will restart the MPC to recover. Enhancements of such error handling will prevent the SFP thread hogging and MPC restart. PR1260517
Layer 2 Ethernet Services
When GRES is enabled, after Routing Engine switchover, the local MAC address is not learned anymore from local the CE router in the VPLS instance because of spanning-tree "discarding" in the kernel table. PR1205373
Problems with IPv4 HTTP traffic forwarding for dual stacked PPPoE client occur after upgrade from Junos OS Release 14.1X50 to 15.1R4. In this scenario, the user requested two addresses in the DHCPv6 Solicit, an IA_NA and an IA_PD. The server was configured to respond with an IA_PD from a local address pool. The IA_NA was assigned with RA and no address pool for IA_NA was configured at the server. Per RFC, The status codes returned in DHCPv6 Advertise/Reply PDUs from the server when an IA_NA address could not be assigned, should be NO_ADDRS_AVAIL. This was the behavior in Junos OS Release 14.1x5-D150. However, a regression caused this status code to be changed to NO_BINDING instead of NO_ADDRS_AVAIL, in Junos OS 15.1 Release. The CPE in question was likely not interpreting the NO_BINDING status code as a failure from the server to assign an IA_NA address. The status code should respond with Advertise/Reply PDUs with the IA_NA status code of NO_ADDRS_AVAIL. PR1224212
During a unified ISSU process, if the first unified ISSU is aborted for some reason, an internal timer will not be cleaned up, and the new lacpd will be forked up. This causes the second unified ISSU in the backup Routing Engine to be aborted in the daemon prepare phase. It will not proceed further. PR1225523
MX Series platforms do not include Delegated-IPv6-Prefix in accounting interim. PR1231665
This issue can be seen if CPE is intiating DHCPv6-Solicit with IA_NA, IA-PD and Rapid-Commit Option but the MX Series router sends the DHCV6 Advertise with Rapid commit flag even though Rapid-Commit statement is not enabled on the MX Series. PR1235578
When DPC cards are used and the set chassis fpc-pfe-liveness-check configuration statement is configured, some alarms can be seen on the DPC cards (/var partition is full) during upgrading from Junos OS Release 15.1F2 - S12 to Junos OS Release 15.1F2 - S13. When trying to downgrade to 15.1F2-S12 the alarm is cleared, and when upgrading to 15.1F2-S13 the alarm is seen again. PR1237218
When LACP is configured in fast periodic along with the fast-hello-issu configuration statement, LACP might time out if there is any interface commit operation on the peer router during unified ISSU, which causes OSPF adjacency flapping. PR1240679
DHCP-Relay option-82 format changes. PR1253205
MPLS
On the P2MP LSP transit router with link-pr otection enabled, if the LSP is the last subLSP, tearing the last subLSP (for example, a RESV tear message is received from downstream router) might crash the routing process (rpd).PR1036452
When you have statically configured ingress and transit LSPs, because of a timing issue, the selfID used by the transit LSP might get allocated to the ingress LSP. Ingress static LSPs do not reuse the same selfID across restarts, whereas transit static LSPs try to reuse the selfID. This situation leads to an RPD crash caused by the collision that occurred when the transit LSP tried to reuse the same ID. PR1084736
You can configure both load-balance-label-capability and no-load-balance-label-capabilitytogether. This is incorrect and confusing. PR1126439
Log messages like /kernel: %KERN-3: tag_nh_iff_record_delete_iff:404 are cosmetic and were switched on in another PR by mistake. PR1171947
When using RSVP-TE protocol to establish LSPs, a make before break (MBB) might not quit and start again when there is a failure on PSB2 (RSVP Path State Block for new LSP) in some cases where PathErr is not seen. For example, for a PSB2 that is already up and there is PathErr processing for it in place already, in this case, no PathErr is seen, owing to local-reversion and a quick flap. As a result, no rerouting happens even if the TE metric cost is raised. This issue has more chances of occurring only when there is non-default optimize switchover delay. PR1205996
When dynamic-tunnel is configured but RSVP signaling is disabled, any configuration that affects dynamic-tunnels could cause the rpd process to crash. PR1213431
In a scaled environment, when there are many Unicast NHs that are related to the same transport LSP (for example, the same RSVP or LDP label), MPLS traffic statistics collection might take too much CPU time in kernel mode. This can in turn lead to various system impacting events, like scheduler slips of various processes and losing connection toward the backup Routing Engine and FPCs. PR1214961
On MX104 Universal Routing Platforms operating with Layer 3 VPN and configured to allow chained composite next hops for devices handling ingress or transit traffic in the network, packets might not be forwarded after they pass through the generic routing encapsulation (GRE) tunnel. This issue is observed on routers operating with Layer 3 VPN that also include the statement chained-composite-next-hop ingress at the [edit routing-options forwarding-table] hierarchy level. When configured in this manner, the Packet Forwarding Engine cannot push VPN labels for packets. As a result, packets arriving at the next-hop destination cannot be forwarded. PR1215382
If the link/node failure that triggered a bypass persists for a long time, and there are LSPs that do not get globally repaired, multiple stale LSP entries are showing down and listed multiple times in the MPLS LSP. PR1222179
In a VPLS environment, if you delete the routing instance, in a rare condition, the rpd process might crash, the routing protocols are impacted, and traffic disruption will be seen due to loss of routing information. This is a timing issue and hard to reproduce. PR1223514
In impacted Junos OS releases ldp will import metric for all IS-IS routes that have tags without the configuration statement track-igp-metric. Junos OS Release 14.1R3 ,14.2R1, and later are impacted by this issue . PR1225592
Under certain conditions, the entropy label value being generated might not provide a good load-sharing result. PR1235258
On MX Series, the rpd might crash when the RSVP bypass undergoes re-optimization and the re-optimized instance encounters failure before it becomes the main instance. The core files could be seen by executing the CLI command show system core-dumps. Stack trace: #0 0x0000000802ad8bd4 in patricia_node_in_tree () #0 0x0000000802ad8bd4 in patricia_node_in_tree () #1 0x00000000009ec3da in tag_pvc_shortwait () #2 0x0000000000a2fe94 in ted_delete_cc_from_link () #3 0x0000000000a3009d in ted2cspf_cleanup () #4 0x0000000000f27d56 in task_job_create_foreground () #5 0x0000000000f289e5 in task_job_bg_dispatch () #6 0x0000000000f24d85 in task_scheduler () #7 0x000000000062b9e2 in main (). PR1250253
With non-stop-routing (NSR) and LDP protocol running, a routing protocol process (RPD) on the backup Routing Engine might consume excessive CPU time if it cannot connect to the RPD on the master Routing Engine. PR1250941
Multicast
RPD creates an indirect next hop when a multicast route (S,G) needs to be installed when listeners show their interest to S,G traffic. The kernel would then create a composite NH. In this case, this appears to be a P2MP MCNH that gets created. When any member interface is not a Packet Forwarding Engine specific interface (for example, Vt, LSI, IRB, or any other pseudointerfaces), the kernel throws a message indicating that FMBB cannot be supported. These messages are harmless and do not have any impact. PR1230465
Network Management and Monitoring
On Junos OS Releases prior to 15.1R6 and 16.1R4, Digital Optical Monitoring (DOM) MIB jnxDomCurrentTable for 1G SFP interface does not return any value. PR1218134
Platform and Infrastructure
In a multicast environment, memory leak might be seen on MX Series with MPCs/MICs-based platform after adding, deleting, and changing multicast groups repeatedly. PR1160909
If you configure micro BFD on an aggregate interface, using native-vlan and if native-vlan is configured on one of the logical interfaces, then ARP resolution fails for that logical interface. PR1172229
On Junos OS platforms with configuration statement delta-export enabled, the delta-export database might not get correctly reinitialized upon one of the following conditions: 1. delta-export is enabled for first time (delta-export is enabled in just this commit). 2. load override (delta-export is enabled in the configuration). 3. commit full (delta-export is enabled in the configuration). Because of these conditions, there is a mismatch in databases in further commits. As a result, the configuration on the backup Routing Engine will be corrupted. PR1199895
When you check default configurations about groups junos-defaults, there is no information shown. PR1201380
Blank firewall logs for IPv6 packets with next-header. PR1201864
With 64-bit rpd, if BGP is applied to an export policy with "from protocol", it might cause an error to filter some routes that are not matched with the value from "from protocol". PR1206511
On MX Series platforms installed with both DPC/E and MX Series-based MPC, when DPC/E detects a remote destination error toward a MX Series-based MPC Packet Forwarding Engine, unexpected fabric drops happened. PR1214461
In large-scale configurations or environment with high rates of churn, the FPC ASIC memory will become "fragmented" over time. It is possible in an extreme case that memory of a particular size will become exhausted and due to the fragmentation, the available memory will not fulfill the pending allocation. PR1216300
MX Series with MPCs/MICs-based linecards might crash after firewall filter configuration change is committed. PR1220185
When any MPC line card is offlined, it goes offline via all offline flows and connection is cleaned, but in the end of the offline flow, somehow it delays powering off the line card. The chasd powers off the MPC via I2cs that write the respective power registers, but the hardware is not really powering off. As a consequence, since MPC is still powered on but the connection is down, it will try to reconnect, then start to come up automatically within 10 seconds. This issue does not occur all the time. PR1222071
Whenever any event (configuration change, login, logout) happens in the system that has to be logged in the accounting server, auditd will be notified about that event. The daemon that notifies auditd about the event writes the event message to a socket and auditd will read from the socket. After reading the message from the socket, auditd will process the event and send the message to the accounting server for logging. A crash occurs when the auditd reads the event message from the socket. Error that is returned while reading is EAGAIN, which means Resource temporarily unavailable. It means currently there is no data to read. When read operation fails, the process is aborted and a core file is generated. PR1222493
NTP peers failed to synchronize in symmetric active mode when there is significant downtime of one peer (for example, due to power maintenance or hardware or software upgrades). PR1222544
This is a race condition between database creation and database access. It is rarely reproducible. There is no functional impact of the core. PR1225086
MAC entry aging is not updated with Source MAC refresh on MPC4E card at slow traffic rate. PR1230516
Firewall filter index mapping becomes incorrect after Routing Engine switchover, because the contents of
/var/etc/filters/filter-define.conf
are incorrectly changed after Routing Engine switchover. PR1230954In AI-Scripts (Advanced Insight Scripts) environment, when some special combination of jcs:printf(...) and some special characters (such as \n \t \\) at the boundary of the buffer, the scripts process might crash and high RPD memory usage is observed. PR1232418
The scale-subscriber license count might increase to an invalid license state with L2TP/LTS clients. This is due to the l2tpd daemon not going through proper state transition on L2TP/LTS clients logout. Hence, license count was not getting updated. The fix will ensure license count is updated on logout regardless of whether the daemon goes through proper state transition or not. PR1233298
The increase in CPU utilization on the FPCs and MPCs might periodically go as high as 100% as a result of the microcode re-balancing mechanism. UCODE Re-balancing involves instrumenting instructions within the Micro Kernel to gather data. While the PPE is running the UKERN thread in a tight thread_yield() loop waiting for a timer to expire, the UKERN scheduler reports the CPU as being 100% utilized. Replaced the tight thread_yield() loop with a timed_semaphore. Now the semaphore marks the UKERN thread as blocked, the scheduler does not report 100% CPU utilization. PR1233390
Login for flow-tap DTCP-over-SSH service fails when SSH key-based authentication is configured for the flow-tap user. PR1234464
MX2010/2020 cannot sample multicast traffic when this multicast is copied to multiple interfaces. This behavior is specific to MX2010/MX2020 with MS-MIC. PR1237164
FPC and Routing Engine might stuck in high CPU when DDoS SCFD is turned on. PR1237486
Starting in Junos OS Release 13.3, the SRX Series cluster needs to run auditd on both nodes. But on MX Series Virtual Chassis Bm and TXP all LCCs also add auditd. Because LCC and VC-BM do not have a route for an accounting server, the following error is generated: -------------------------------------------------------------------------- 2565 root 1 96 0 3304K 2620K RUN 0:01 0.00% auditd lcc0-re0: -------------------------------------------------------------------------- 2398 root 1 96 0 3240K 2536K select 0:01 0.00% auditd lcc1-re0: -------------------------------------------------------------------------- 2791 root 1 96 0 3244K 2544K select 0:01 0.00% auditd %DAEMON-3: auditd[2398]: sendmsg to 10.233.225.78(10.233.225.78).1813 failed: Network is down %DAEMON-3: auditd[2398]: AUDITD_RADIUS_REQ_SEND_ERROR: auditd_rad_send: sendto/sendmsg: Network is down. PR1238002
Due to a regression issue, the presence of errors or traps during unified ISSU might result in LU/XL-based FPC crash. PR1239304
An FPC crash or only traffic loss might be seen on MPC1E/2E/3E/4E or MPC-3D-16XGE-SFPP during unified ISSU. This issue occurs because counter memory might get corrupted during unified ISSU. It is a timing issue. PR1241729
Auditd might crash when RADIUS servers are not reachable and when there are multiple times of Routing Engine switchover. When we try to send RADIUS requests to non-reachable RADIUS servers, we try for maximum number of times. After the maximum number of tries is reached, we close the socket used to send RADIUS requests. After the socket is closed, we are trying to dispatch next message resulting in crash. Auditd will get restarted automatically after it is crashed, so that RADIUS messages if any present in the queue at the time of crash will be lost. After auditd gets restarted, the next event that has to be sent to RADIUS server, will be sent normally. PR1250525
On a router with MPC5Es or MPC6Es, if VPLS or bridging features are configured, it is possible that unicast L2 packets with known MAC addresses are flooded instead of being forwarded to the known ports. PR1255073
Routing Policy and Firewall Filters
With rib-groups configured for importing routing information to multiple routing tables, unexpected route refresh might happen when committing configuration change. This issue occurs because of a defect in code related to secondary table list handling. PR1201644
Routing Protocols
For devices populated with master and backup Routing Engines and configured for nonstop active routing (NSR) and Protocol Independent Multicast (PIM) configuration, the routing protocol process (RPD) might crash on the backup Routing Engine due to a memory leak. This leak occurs when the backup Routing Engine handling mirror updates about PIM received from the master Routing Engine deletes information about a PIM session from its database. But due to a software defect, a leak of 2 memory blocks (8 or 16 bytes) might occur for every PIM leave. If the memory is exhausted, the rpd might crash on the backup Routing Engine. There is no impact seen on the master Routing Engine when the rpd crashes on the backup Routing Engine. Use the show system processes extensive command to check the memory. PR1155778
After Routing Engine switchover, a race condition could result in a RIB not registering for route flash. As a result, there might be stale entries seen when routes are withdrawn. This is a rare condition. PR1170572
When you have LSPs as OSPF neighbors and run run show snmp mib walk decimal 1.3.6.1.2.1.14 you get the message request failed, and the command does not complete. PR1177315
In a BGP scenario with inet-mdt family configured under protocols BGP, route table <NAME>.mdt.0 might get deleted if it has no routes. As a result, rpd might crash on the backup Routing Engine, and BGP sessions might flap on master Routing Engine. PR1207988
When changing the route distinguisher (RD) for an existing routing instance with established MSDP sessions or deleting or deactivating an MSDP session in the configuration, the rpd process might crash, which leads to traffic disruption. PR1216078
rLFA OSPF protection path/next hop keeps flapping every 4-5 seconds when metric order policy is configured. PR1220343
When the first multicast packet gets fragmented because of its large size, the receiver in an MVPN scenario does not receive all fragments. The fix of this PR will ensure that the software waits until the last fragment of the PIM register packet is received at the rendezvous point (RP) before processing the PIM resolve request. After the last fragment of register packet is received, PIM register state is created and PIM resolve request is triggered to install the multicast route. Hence, all fragments of the register packet will get forwarded to the receiver. PR1229398
Junos OS Release 15.1 and later might be impacted by the receipt of a crafted BGP UPDATE which can lead to an routing process daemon (rpd) crash and restart. Repeated crashes of the rpd daemon can result in an extended denial of service condition. Refer to JSA10778 for more information. PR1229868
Remote LFA protection might not work for the OSPF route when there is no ECMP to act as a candidate for the PQ node (the PQ node's router ID belongs to a different area).PR1230322
In a rare condition after BGP session flaps, BGP updates might not be sent completely, resulting in BGP routes shown in the advertising-protocol table in the local end but not shown in the receive-protocol table in the remote end. PR1231707
The routing protocol process (rpd) sometimes is interrupted and halted when it tries to free a session reference block. This can occur when the memory redzone check fails when attempting to free reference memory block. The fail is caused when the redzone check receives an address that is not the beginning of a memory block. PR1232742
When MX Series router is running protocol BGP, and policy configuration is modified, an assertion condition might be hit where the routing protocol process (rpd) generates a core file. PR1239990
Session uptime in show bfd session detail output omits seconds if uptime is longer than 24 hours, which is different from similar output for Label Distribution Protocol (LDP), Open Shortest Path First (OSPF), or Border Gateway Protocol (BGP). Seconds are always included in corresponding outputs for these protocols. PR1245105
On all platforms, if multi-area rLFA along with policy is configured, Open Shortest Path First (OSPF) nexthop might keep flapping. PR1248746
Services Applications
In an L2TP scenario, when the LNS is flooded by high rate L2TP messages from LAC, the CPU on the Routing Engine might become too busy to bring up new sessions. PR990081
IDP policy is trashing with following log messages: Aug 23 20:56:25 esst480a jlaunchd: idp-policy (PID 8562) started Aug 23 20:56:25 esst480a jlaunchd: idp-policy (PID 8562) exited with status=0 Normal Exit Aug 23 20:56:25 esst480a jlaunchd: idp-policy (PID 8564) started Aug 23 20:56:30 esst480a jlaunchd: idp-policy (PID 8564) exited with status=0 Normal Exit Aug 23 20:56:30 esst480a jlaunchd: idp-policy (PID 8570) started Aug 23 20:56:35 esst480a jlaunchd: idp-policy (PID 8570) exited with status=0 Normal Exit Aug 23 20:56:35 esst480a jlaunchd: idp-policy (PID 8574) started Aug 23 20:56:40 esst480a jlaunchd: idp-policy (PID 8574) exited with status=0 Normal Exit. On MX Series platforms, running IDP process is stopped from Junos OS Release 14.2 and later. So idp-policy configuration does not work. This is an expected behavior. Starting with the fixed versions we have completely deprecated the IDP related CLIs on MX Series platforms and we should not see any IDP related messages. PR1209351
The kmd process might monopolize the CPU when continuous polling for IKE-related data through SNMP. This issue is specific to IKE-related SNMP polling and not seen when continuously polling IPsec-related data through SNMP. PR1209406
In case of massive flapping of subscribers on M120 platform, a memory leak on IQ2E PIC can happen and it can result in the inability to attach a configured CoS policer to the newly connected l2tp subscriber. PR1210976
When loading or rolling back a configuration that removes a service-set and changes where the MS interfaces are assigned, traffic may be blackholed to a series of the existing service-set might be dropped. PR1223302
When the stateful firewall flows time out repeatedly, there can be performance degradation on the MS-DPC PIC. This will eventually lead to the MS-DPC being unable to scale to the peak flows that are allowed. PR1242556
On Layer 2 Tunneling Protocol (L2TP) network server (LNS) router L2TP tunnels might be stuck in "Terminating" state after execution of particular sequence of CLI commands. Deactivation of tunnel-group on LNS leads to cleanup of all logged-in L2TP subscribers and L2TP tunnels. If the clear services l2tp tunnel command is issued when the cleanup has not been completed, it is possible that the tunnel will not be cleaned up properly and will get stuck in "Terminating" state. PR1249768
Subscriber Access Management
In a subscriber management environment with two or more RADIUS servers connected to an MX Series router, syslog is not generated when radius server is mark dead. PR1207904
On MX Series routers with dual Routing Engines, after router GRES, if you add a traceoptions filter before GRES is fully completed, the authd process might crash. PR1234395
The command show network-access aaa statistics radius detail can display incorrect number of messages to RADIUS server in case configured RADIUS server's are continuously flapping. PR1267307
User Interface and Configuration
This issue is specific to a router running a Junos OS Release up to 15.1R<x>, which also has authentication-key-chains configured. When the secret for a key is not configured, commit fails with the message error: configuration check-out failed: daemon file propagation failed. This issue is not applicable to Junos OS Release 15.1F, 16.1, and later. PR1213165
Some configuration objects are not properly handled by "delta-export" (dexp). This leads to an omission of the section of the configuration. PR1245187
VPNs
For NG-MVPN, the traffic threshold is ignored if it is configured in a configuration group, then applied to an MVPN instance. If the traffic threshold is configured directly under the MVPN instance, the issue is not seen. PR1191002
On Junos OS platforms, only VPLS supports automatic-site-id. Configuring automatic-site-id under the L2VPN instance could cause a rpd core file. The fix has now been provided to add a commit check to disallow configuring automatic-site-id under a L2VPN instance. With this fix, a commit error will be generated if you attempt to configure automatic-site-id under an L2VPN instance. PR1214328
The routing protocol process (rpd) might eventually become exhausted and crash when Layer 2 Circuit, Layer 2 VPN, or virtual private LAN service (VPLS) configurations are committed. These commit activities might create a small memory leak of 84 bytes in the rpd. PR1220363
In an NG-MVPN scenario with the asm-override-ssm configuration statement for source-specific multicast (SSM) group, if you issue the clear pim join command on the source PE router, downstream interfaces get pruned, causing the multicast flow to stop. PR1232623
On M Series and MX Series platforms, the L2circuit does not switch from primary to backup and vice versa based on the APS status change, because when APS switchover happens, the PW switchover does not switch to the new APS active neighbor. PR1239381
With NSR enabled and a Layer 2 circuit configured, an rpd crash might be observed on the backup Routing Engine when you change the Layer 2 circuit neighbor and then commit the changes. PR1241801
Resolved Issues: 15.1R5
Class of Service (CoS)
In rare cases, after polling "show snmp mib walk jnxCosQstatTxedBytes", cosd coredump might occur due to memory corruption on Junos platform with COS enabled. PR1199687
The actual problem seen is Logical Interfaces binded to Routing-instance classifier is not seen under classifier Index inside CFEB. The cause for this Issue was "missing else statement was leading to data getting overwritten for LSI scenario”. The same has been Corrected. PR1200785
Forwarding and Sampling
The dfwc (daemon that performs as firewall compiler) might fail to get filter information from the kernel in COMMIT_CHECK (config validation) mode. As a result, the filter index is regenerated starting from index 1. This will create the mismatch of filter index as compared to the existing filters in the system. The fix provided will identify and recover the issue. PR1107139
Commit gives error as follows when apply-groups is configured under bridge domain. error: Check-out failed for Firewall process (/usr/sbin/dfwd) without details. PR1166537
SRRD(Sampling Route-Record Daemon) process doesn't delete routes when the DELETE is received from RPD in few configuration cases. This results in build-up of memory in SRRD daemon and once SRRD reaches the limit, it crashes and restarts itself. This happens only when one certain family is not configured on all of the FPC clients (e.g., FPC with inline J-Flow enabled or PIC with PIC-based sampling enabled is one client). For example, only IPv4 family is configured in all the clients, and IPv6 and MPLS families are not configured for sampling in any of the clients. PR1180158
The changes to srrd (sampling route reflector daemon - new architecture for sampling) process between 14.2R5.8 and 14.2R6.5 severely reduce MX80 series available memory and therefore RIB/FIB scaling. PR1187721
Starting with Junos Release 14.2R1, FPC offline could trigger Sampling Route Record (SRRD) daemon restart. PR1191010
On MX platform with "Enhanced Subscriber Management" mode, if default forwarding-classes are referenced by subscriber filters, commit configuration changes after GRES will be failed. PR1214040
General Routing
In the scenario when one interfaces having same IP addresses with a RSVP strict path en-routed interface IP address (for example, subscribed interface borrows the loopback interface IP address scenario, or where one of PE-CE interface inside a VPN instance has the same IP address of the routerÂs uplink RSVP interface in master instance), RSVP-TE would send PathErr to ingress router due to matching to wrong interface which is not RSVP interface but having same IP address with the RSVP interface when checking the explicit route object (ERO).PR 1031513
On dual Routing Engine platform with GRES and NSR enabled, after Routing Engine switchover, the rpd might crash when trying to destroy a CNH NH (composite next hop, for example, it would be created in PIM, L3VPN, MVPN scenario and so on) with valid reference on it. It is because that during switchover (while backup rpd switches to master), there is a transition period where rpd switched to master mode but KRT is still in backup mode. If KRT (still in backup mode) receives a CNH addition followed by Route additions using this CNH during this phase, it would result in CNH in KRT with valid route references yet on expiry queue. It is hard to reproduce, in this case, it occurs after Routing Engine switchovers consecutively at two times. PR1086019
The configuration support for enabling ingress and egress layer2-overhead is available in dynamic-profile but the functionality is not supported in 15.1R3 and 15.1R4. For example, set interfaces ge-4/2/9 unit 0 account-layer2-overhead ingress 30 set interfaces ge-4/2/9 unit 0 account-layer2-overhead egress 30 With the above configuration, the number of layer2-overhead bytes (30) are not added to the input bytes in traffic statistics. PR1096323
During NSR Routing Engine switchover, there might be a control plane black window for inline BFD causing the BFD session to flap. This is a day-1 issue, and tuning the retrans timer would solve the problem. But since these timers have to meet RFC compliance, we cannot really do that. Today we have the retrans timer as 1000 milliseconds. The workaround would be to configure a higher retrans timer value. PR1105980
The rpd fails to respond any new CLI routing commands (for example, show mpls lsp terse). Rpd is forking a child process while rpd is processing a show command. When the subprocess tried to exit, it tried to close the management socket being used by the show command. This failure might cause rpd subprocess to crash and generate a core file. It also removes the rpd pid file which prevent rpd from processing any new CLI commands even though original rpd process continues to run normally. PR1111526
During initial ramp up of an IPSec session, a race condition might cause the mspmand process crash in rare circumstances. PR1116487
On MX Series platform with MS-MPC/MS-MIC in use, due to some reason if the NAT session is freed/removed but without removing timer wheel entry, then it might cause MS-MPC/MS-MIC crash. It is a timing issue where just before invoking the timer wheel callback the NAT session extension got freed/removed. PR1117662
On MX Series platforms, the MS-MPC crash might occur. The exact trigger of the issue is unknown; normally, this issue might happen over long hours (e.g, within a week) of traffic run (e.g, running HTTP/HTTPS/DNS/RTSP/TFP/FTP traffic profile). Coredumps might point to - Program terminated with signal 4, Illegal instruction PR1124466
The jsscd might crash in static-subscribers scaling environment (e.g. 112K total subscribers, 77K dhcp subscribers, 3K static-subscribers, 32K dynamic vlans), when this issue occurs the subscribers might be lost. abc@abc_RE0> show system core-dumps -rw-rw---- 1 root field 8088852 Jan 1 11:11 /var/tmp/jsscd.core-tarball.0.tgz PR1133780
In a multicast virtual private network (MVPN) scenario during route churn, the rpd process might crash due to inconsistency multicast next-hop between rpd and kernel. PR1138366
On MX Series platforms with FPC3, the octets of IPv4 source and destination addresses in the firewall log are listed reverse; this might affect troubleshooting. The IPv6 log works fine. This is a minor issue, there is no other service impact. PR1141495
During route flaps such as (interface flaps or network instability) the Packet Forwarding Engine may reboot or Packet Forwarding Engine may notice next-hop corruption. PR1151844
If any linecard crashes early during ISSU warmboot, the CLI might report ISSU success, resulting in a "silent ISSU failure". PR1154638
In sampling feature, certain scenarios force handling of the sampled packet at the interrupt context , which may have chance to corrupt the BMEB packet context , and lead to BMEB FDB corruption. PR1156464
On MX Series with MPCs/MICs platforms with MPC2-NG/MPC3-NG/MPC3/MPC4/MPC5/MPC6 installed, in rare cases, a very rare hardware error - TSTATE Parity error might occur. It can cause FPC getting stuck, but it will not trigger the error-reporting infra (CMERROR). Fixes have now been provided. PR1156491
The default (per-packet load balancing) PPLB export policy created for Ethernet VPN (EVPN) has been removed from JUNOS. It was used to enable per packet load-balance for EVPN routes on certain MX platforms and not all. Now per-packet load balance needs to be configured explicitly. PR1162433
On Junos 15.1 and above, after Routing Engine switchover and both Routing Engine reboot, krt queue might get stuck. It's because: under this scenario, agentd creates it's table before rpd reading tables. But after rpd restarting and rebuilding tables, it could not filter an agentd's table out. It might cause slow route convergence or traffic loss. This issue would disappear automatically in 30 minutes. PR1162592
On MX Series router with services PIC (MS-DPC/MS-MPC/MS-MIC), the ICMP time exceeded error packet is not generated on an IPsec router on the decap side. PR1163472
When the MS-MIC or MS-MPC installed in MX Series router is processing traffic, and the IPsec policy configuration is changed by means of adding or upating a policy, mspmand process crash might occur. PR1166642
Sampled continues logging events in trace option file after trace option for sampled deactivated. This can be hit if there is no configuration under 'forwarding-options sampling' but other configuration for sampled is present (e.g. port-mirroring). PR1168666
When MS-MPC is used, if any bridging domain related configuration exists (e.g. "family bridge", "“vlan-bridge"”, "“family evpn", etc), in some cases, continuous MS-MPC crash hence traffic loss mighty occur. PR1169508
When using Periodic Packet Management process (PPMD, responsible for periodic transmission of packets on behalf of its various clients) related protocols (e.g. LFM, CFM, LACP, BFD, etc), during fabric or SIB online process, possibly, the client session (who establish adjacencies with PPMD to receive/send periodic packets on those adjacencies, such as LFM, CFM, LACP, etc) of PPMD may flap due to CPU hog issue. PR1174043
On Virtual Tunnel (vt) tunnel environment with forwarding-class, customer is using AE interface to terminate subscribers on the box and the AE interface has members on two different FPCs, due to a software defect, the mirrored traffic is not going to the correct forwarding class as expected. The issue is also seen when terminate Subscribers and vt tunnel hosted interface are on two different FPCs (Non-AE case). PR1174257
When using MS-MPC or MS-MIC service cards, a single pool cannot be used in different service-sets. Separate pools with different names would then need to be used. Additionally, pools created automatically by a source-prefix or destination-prefix statement will not work if the same source-prefix or destination-prefix statement appears in a different service-set. PR1175664
MTU discovery may not be working due to luck of VRF info on egress card for BBE Subscriber traffic. PR1177381
This is a display issue and doesn't affect functionality of the power, fixing has been added to commands 'show chassis power' and 'show chassis environment pem', when one of the DC PEM ciruit breaker tripped. PR1177536
CGNAT-NAT64: Few port leak are observed for the EIM/EIF IPv4 traffic(2M sessions) from public side. PR1177679
destination-prefix-list support list added for NAT rule with twice-napt-44 translation. Customer will be able to define a prefix list and match it in the NAT rule while using twice-napt-44. PR1177732
If "router-advertisement" protocol is configured in client ppp profile, unsolicited RA might be sent before the IPv6CP Configuration ACK is received. PR1179066
After One side PE Junos upgrade from the release before 15.R1 to the release after 15.1R1, due to the construction of es-import-target changed , type 4 routes are not imported and missed in table __default_evpn__.evpn.0, which caused both PEs thought itself is DF router and forwarding BUM frames.This will prevent to upgrade Junos in production network. PR1179443
On T-series platforms with 10x10GE Type 4 PIC installed, if an interface in such PIC is configured with WAN PHY mode, the CoS configuration on the port will be incorrectly programmed and it might result in unexpected packet drop. PR1179556
On dual Routing Engine platforms, if interface changes occur on Aggregate Ethernet (AE) which result in marking ARP routes as down on the AE (e.g. bringing down one of the member links), due to interface state pending operation issue on backup Routing Engine, in race condition, the backup Routing Engine may crash and reboot with an error message (panic:rnh_index_alloc: nhindex XXX could not be allocated err=X). PR1179732
In the CGNAT CLI show service alg conversations fails to display parent session status for ALG conversations. PR1181140
In case of point to point interfaces and unnumbered interfaces rpd crash might be seen in corner cases on configuration changes. There is potential fix given through this PR to avoid the crash. PR1181332
When "dynamic-tunnels" is configured with configuration statement "gre", performing Routing Engine switchover might result in rpd crash. PR1181986
Fragmented ALG control traffic is not supported on the MS-MPC. PR1182910
With NAT translation-type as napt-44, a few sessions are getting stuck upon deactivating/activating service-set or corresponding applications at a few times with traffic running. The same symptom is seen upon deactivating/activating service-set with traffic running and with 'deterministic-napt44' translation type as well. PR1183193
CGNAT Pool stats for "Available address" is shown incorrect for destination pool. Available address shown zero even though destination nat IPs are available PR1183538
With BGP add-path and consistent-hash enabled, when a BGP learnt route prefix with multiple paths(next-hop) is installed in the forwarding-table, all the next-hops should be reachable/resolvable at the time of installing the route in the forwarding-table. However, there might be a chance that any of the next-hops are not resolvable at that time, which will lead Packet Forwarding Engine's incorrect route programming. In this case, traffic forwarded to this prefix will be affected. PR1184504
When IPv4 firewall filter have 2625/32 destination in prefix-list , filter attached to subscriber interface is found broken. PR1184543
Starting with 15.1F5, the splitting of destination NAT pools across AMS members will be prevented. Currently with AMS interfaces, dnat44 pools do not get split. However, all twice-NAT destination pools are split. This is not needed and this change makes it so (source pools are split or/and hashing is based on source so there is never any chance of conflict). Please work with Francois to get details. PR1184749
Continuous reporting of the following messages might be noticed sometimes while bringing up all IFD/IFL/IFF states at once.
Apr 1 11:16:05 mx2020-1 dot1xd[16641]: %-: task_receive_packet_internal: knl Ifstate packet from zero-len socket 8 truncated. Apr 1 11:16:05 mx2020-1 dot1xd[16641]: %-: Free allocated bufp:(a433004) buflen:(16384)task_receive_packet_internal: knl Ifstate packet from zero-len socket 8 truncated. Apr 1 11:16:05 mx2020-1 dot1xd[16641]: %-: task_receive_packet_internal: knl Ifstate packet from zero-len socket 8 truncated. Apr 1 11:16:05 mx2020-1 dot1xd[16641]: %-: task_receive_packet_internal: knl Ifstate packet from zero-len socket 8 truncated. Apr 1 11:16:05 mx2020-1 dot1xd[16641]: %-: Free allocated bufp:(a433004) buflen:(16384)task_receive_packet_internal: knl Ifstate packet from zero-len socket 8 truncated. Apr 1 11:16:05 mx2020-1 dot1xd[16641]: %-: task_receive_packet_internal: knl Ifstate packet from zero-len socket 8 truncated.
During syncing of ifstate dot1xd, try to read all the Physical Interfaces/ifl/iff state at once. In scale scenario, the size of these information will be very high. It may exceed demon rlimit / memory availability.PR1184948
In IPv6 environment, adding a link local neigbour entry on subscriber interface then adding a new lo0 address, if delete this neighbour entry and the subscriber interface, due to software defect, the nexthop info is not cleaned properly, the rpd process might crash. The routing protocols are impacted and traffic disruption will be seen due to loss of routing information. PR1185482
When ams-interface is configured in warm-standby mode without adding any members, configuration commit will lead to rdd core. PR1185702
AMS redundant interfaces not listed under possible-completions of operational commands. PR1185710
In IPv6 environment with graceful Routing Engine switchover (GRES) enabled, when a new prefix (global address) is added on the donor interface (in this case, loopback interface), and then perform GRES, the ksyncd process crash might be observed due to kernel replication error. PR1186317
When both AMS-redundant interface and AMS-load-balancing interface is configured in the system, 'Not a deterministic nat pool' syslog is generated whenever deterministic-nat show cli command 'show services nat deterministic-nat nat-port-block' is executed. PR1186723
JUNOS might improperly bind Packet Forwarding Engine ukernel application sockets after ISSU due to a bug in IP->TNP fallback logic. Because of that bug, threads running on the ukernel that relay on UDP sockets can experience connectivity issues with host, which in turn can lead to various problems. For instance, sntp (simple network time protocol) client might fail to synchronize time, which in turn might lead to other problems such as failure in adjacency formation for HMAC authenticated protocols. PR1188087
By default SNMP will cache SNMP values for 5 seconds. Sometimes kernel will cache these values for longer duration. This PR will correct the caching behavior. PR1188116
The command "request system reboot both-routing-engines local' on VC-Mm will reboot only one Routing Engine on an MX-VC, with this fix, it will reboot both Routing Engines of local chassis. In addition, this fix also removes the "set virtual-chassis member <n> role line-card" configuration option on an MX-VC because this option is not supported on MX-VC as designed. PR1188383
On MX routers, a vulnerability in IPv6 processing has been discovered that may allow a specially crafted IPv6 Neighbor Discovery (ND) packet to be accepted by the router rather than discarded. The crafted packet, destined to the router, will then be processed by the Routing Engine (RE). A malicious network-based packet flood, sourced from beyond the local broadcast domain, can cause the Routing Engine CPU to spike, or cause the DDoS protection ARP protocol group policer to engage. When this happens, the DDoS policer may start dropping legitimate IPv6 neighbors as legitimate ND times out. Refer to JSA10749 for more information. PR1188939
Ingress queuing configuration on MPC2ENG is leading to host loopback wedge due to some bug in the code specific to MPC2ENG; there is a mis-programming in the Junos code for the lookup chip for this type of card. PR1189800
When polling an si-interface hosted on an NG-MPC Non-HQoS line card (MPC2E-3D-NG, MPC3E-3D-NG) ,there always has a 10 sec delay,which might break SNMP polling. PR1192080
As described in RFC7130, when LACP is used and considers the member link to be ready to forward traffic, the member link MUST NOT be used by the load balancer until all the micro-BFD sessions of the particular member link are in Up state. PR1192161
If a message received from LLDP neighbor contains "Port Id" TLV which has "Interface alias" subtype and is longer than 34 bytes, subsequent running of "show lldp neighbors" might lead to l2cpd crash. PR1192871
On MX series with MPC3/MPC4/MPC5/MPC6, the VSC8248 firmware on the MPC crashes occasionally. This PR enhances the existing VSC8248 PHY firmware crash detection and recovery, helping recover from a few corner cases where the existing JUNOS workaround does not work. PR1192914
When MoFRR activated, multicast source route flapping leads to corresponding multicast traffic 100% drop. PR1194730
On Junos OS Release 15.1R3 and later with Tomcat model BBE release, if a subscriber login/logout which using multicast service, then another subscriber login and also use multicast service, this may cause bbe-smgd core on backup Routing Engine. PR1195504
In inline BFD or distributed BFD (in Packet Forwarding Engine) scenario, Packet Forwarding Engine fast reroute is not invoked anymore if the remote peer signals BFD ADMINDOWN message to local node and convergence time is performed based on protocol signaling. PR1196243
On platforms running Junos OS with FreeBSD10, if tracing is enabled, due to the log file pointer not being handled correctly for log file rotation, the rpd process might crash when the log file rotates. PR1196318
Distributed BFD session using inline-redirection on MX-VC might not work if the ANCHOR Packet Forwarding Engine is not within the same chassis member as the interface where the BFD packet is received from peer device PR1197634
L2VPNs or L2Circuit services along with lengthy interfaces descriptions might lead to memory leak in variable-sized malloc block, which in turn results in RPD crash due to "out of memory". PR1198165
Problem: ======== The following continuous error messages are generated during 2X100GE CFP2 OTN MIC online on MX2K. This error message means PCI control signal communication failure between Packet Forwarding Engine on MPC6E and PMC Sierra OTN framer (pm544x) on MIC 2X100GE CFP2 OTN. *** messages *** Jul 25 17:39:04.807 2016 MX2K : %PFE-3: fpc0 cmic_pm544x_hires_periodic: error getting counters Jul 25 17:39:04.893 2016 MX2K : %PFE-3: fpc0 Failed in function pm544x_manage_link:2616 Jul 25 17:39:05.267 2016 MX2K : %PFE-3: fpc0 Failed in function pm544x_link_status:2449 Jul 25 17:39:05.267 2016 MX2K : %PFE-3: fpc0 cmic_pm544x_hires_periodic: error getting counters Jul 25 17:39:05.267 2016 MX2K : %PFE-3: fpc0 Failed in function pm544x_manage_link:2616 Jul 25 17:39:05.267 2016 MX2K : %PFE-3: fpc0 Failed in function pm544x_link_status:2449 Jul 25 17:39:05.321 2016 MX2K : %PFE-3: fpc0 cmic_pm544x_hires_periodic: error getting counters Jul 25 17:39:05.408 2016 MX2K : %PFE-3: fpc0 Failed in function pm544x_manage_link:2616 Jul 25 17:39:05.486 2016 MX2K : %PFE-3: fpc0 Failed in function pm544x_link_status:2449 Root cause: =========== Bug was in converting the 32bit PCI shared address to 64 bit address. When the MSB of the 32bit address was set, the conversion was buggy as it type caused it to signed long int, which resulted in extending the sign bit to first 32 bits of the converted 64bit address. The first 32bit of the converted address is expected to be zero as our memory is only 32 bit addressable. Problem appearance on customer deployments: =========================================== 1. Issue will be seen only when there are large number of nexthops in the Packet Forwarding Engine due to Packet Forwarding Engine anchor feature before the MIC is made online. 2. If the MIC came online without hitting this issue, then there is no chance of hitting this issue later. Because the bug was in the PCI shared memory allocation, which happens only during the MIC online. 3. This issue started showing after the Packet Forwarding Engine anchoring feature, which delayed the MIC online until the next-hops are sync to Packet Forwarding Engine. As a result the MIC is coming online very late and the shared memory allocation is coming from the higher RAM address, which the PMC vendor code porting layer is failing to handle. After the fix from this PR, we should not hit this issue. PR1198295
With MPC-NG or MPC5E hardware, the range of the queue weights on an interface is from 0 to 124. As every queue has to have an integer value of queue weight, it might be impossible to assign the weights in exact proportions to the configured transmit-rate percentage. Therefore, when a physical interface operates in a PIR-only mode, this might cause imprecise scheduling results. PR1200013
On MX Series platforms, the mspmand process might crash on the MS-MPC with XLP B2 chip (e.g.REV17). The exact trigger is unknown. It is usually seen with 70% to 90+% CPU load conditions. PR1200149
GUMEM errors for the same address may continually be logged if a parity error occurs in a locked location in GUMEM. These messages should not be impacting. The Parity error in the locked location can be cleared by rebooting the FPC. PR1200503
MS-MPC/MS-MIC: MSPMAND generates core files when an encrypted packet is received out of the range of replay-window size. The issue might occur in peak loads where by encrypted packets received, out of order due to drops in the network. PR1200739
Dynamic firewall filter programs incorrect match prefix on the Packet Forwarding Engine PR1204291
Packet Forwarding Engine may install next-hop incorrectly and cause traffic loss, if there is a next-hop policy pointing to a IPv6 address which need to be resolved. PR1204653
If send upstream and downstream IPv4+IPv6 traffic for PPPoE subscribers, mirrored traffic loss would be seen. PR1204804
VC link "last flapped" timestamp is reset to "Never" on the new backup Routing Engine after MX VC global GRES switchover. PR1208294
The cpcdd daemon might core and restart on the subscriber scenario with CPCD (captive-portal-content-delivery) service configured. PR1208577
On MX Series running Tomcat release, if route-suppression is configured for access/access-internal routes as well as destination L2 address suppression is configured for the subscriber, bogus destination MAC would be generated for the subscriber. PR1209430
The logic to calculate the IPsec phase2 soft lifetime has been changed in 14.2R6, resulting in an interop issue in certain scenarios. A hidden configuration statement is provided as part of this PR which will revert the soft lifetime logic to the one used in 11.4 release. PR1209883
BGP PIC Installs multiple MPLS LSP next hops as Active instead of Standby in Packet Forwarding Engine. This can cause a routing loop. PR1209907
On MX series routers, when configuring the dynamic access routes for subscribers based on the Framed-Route RADIUS attribute, the route will be created on the device, however, the it will be installed as an access-internal route instead of access route if it has /32 mask length. PR1211281
Inline Jflow - Sequence number in flow data template is always set to zero on MPC5E and above line card type PR1211520
On T-series platforms, if interfaces from FPC Type 4 and FPC TYPE 5 are configured together in one VPLS routing instance, incorrect TTL might be seen when packets go through the VPLS domain, for example, packets received via one FPC TYPE 4 might be forwarded to other FPC type 4 with incorrect TTL. The incorrect TTL could cause serious VRRP issue. When VRRP is enabled, after one CE sends the VRRP advertise packets with TTL value 255, other CE might receive the VRRP packet with TTL value 0 and therefor discard these VRRP packets. As a result, the VRRP status in both CE becomes Master/Master. PR1212796
The MS-MPC/MS-MIC service cards can encounter a core when using certain ALGs or the EIM/EIF feature due to a bad mapping in memory. PR1213161
When FPC Type 5 - 3D cards run into over-temperature condition, in T4000 router. It is possible that under certain circumstances: - chassisd will declare the over-temperature condition and by default the router will shut down in 240 minutes. - Over-temperature SNMP trap (jnxOverTemp) are not sent to external NMS. PR1213591
MX-VC: All VCP interface experiences tail-dropped as result of configuration conflict. It is a good idea to reference documentation and customize the COS associated with VCP interfaces. In this scenario customer has configured a corresponding xe-n/n/n interface with just a description to denote that port is dedicated to VCP. Problem is the resource calculation is impacted and reports smaller queue-depth maximum values when both network interface xe-n/n/n and vcp-n/n/n are defined. Issue is more likely to occur with dynamic modification add/delete of vcp interfaces with a corresponding network interface xe-n/n/n configured. > show interfaces queue vcp-5/3/0 | match max Maximum : 32768 Maximum : 32768 Maximum : 32768 Maximum : 32768 PR1215108
If zero length interface name comes in the SDB database, on detection of a zero length memory allocation in the SDB database, a forced rpd crash would be seen. PR1215438
On Junos OS Release 15.1R3 and later MX Series platform release, if DHCPv4 or DHCPv6 subscriber is configured and the subscriber joins more than 29 multicast groups, the line card might crash. PR1215729
Incorrect source MAC used for PPPoE after underlying AE is changed PR1215870
Prior to this fix for LI releases, parameterized family inet filter with term matching on address with non-contiguous mask will result in CLI syntax error which would fail subscriber login or CoA requests. PR1215909
The AMS interface is configured in warm-standby mode when fail-over occurs a percentage of the traffic might fail to get NAT. The issue is after the failover the internal mappings driving traffic back to the service PIC might fail. PR1216030
If RS/RA messages were received through an ICL-enabled(MC-AE) IFL, packet loss would be seen and last for a while. PR1219569
The bbe-smgd core occurred in bbe_autoconf_if_l2_input when DHCP client generates ARP. PR1220193
During CoA request there are no changes on schedulers. Requests are received successfully, but no changes from CoS side. PR1222553
Due to a defect related to auto-negotiation in a Packet Forwarding Engine driver, making any configuration change to interface in MIC "3D 20x 1GE(LAN)-E,SFP" might lead to interface flapping. PR1222658
High Availability (HA) and Resiliency
In PPP environment with access-internal and multiple routing instances, after restart rpd process, the access-internal route might disappear. PR1174171
Backup routing engine might restart unexpectedly due to memory leak after switchover. PR1198005
Infrastructure
With 13.3 releases using Ericsson/ Juniper EPG platforms, some session PIC C-PIC cards might experience some race condition resulting into kernel vmcores, following by reboot (failover to spare C-PICs) due to soft-update BSD enabled in some partitions of the Routing-Engine. The Softdeps on freebsd is not used any longer in freebsd6 where the fix includes disabling it on all Junos OS partitions. PR1174607
From Junos OS Release 15.1 and later, smart error message of Unigen SSD may be seen. Smartd reads SSD attributes and checks on 197-current-uncorrectable, 198-offline-uncorrectable by default. To Unigen, 198 is not = Offline-Uncorrectable, it is 'Total Count of Read Sectors'. As it is Total-Read, such attribute(198) always carries value and smartd reports it as 'Offline Uncorrectable Error'. PR1187389
The statistics info of em0 is 0 when checking by SNMP or CLI show command. PR1188103
Interfaces and Chassis
In a VPLS scenario the flood NH for the default mesh group might not be programmed properly. A complete black-holing for the VPLS instance would be seen as a consequence. PR1166960
In previous release, only IEEE classification is supported for CFM OAM packets. In the fix, we will support 802.1AD based filter for CFM OAM packets. when Linktrace and loopback requests are received in MX, 802.1p bits is used to determine the forwarding class and queue for response or linktrace request forwarded to next router, this cause these PDUs are put to wrong queue when input-vlan-map pop is present because received PDU doesn't carry 802.1p bits. In the fix, we will use incoming forwarding class to determine the 802.1p priority and outgoing forwarding class and queue for new generated response or link trace requests. PR1175951
On dual Routing Engine system, if master Routing Engine is running Junos OS 13.3R9/14.1R7/14.2R5/15.1R3/15.2IB or later, backup Routing Engine is running Junos OS prior to 13.3R9/14.1R7/14.2R5/15.1R3/15.2IB, a major alarm is raised. This is cosmetic and can be safely ignored. Please upgrade backup Routing Engine to the same release with master Routing Engine to avoid the issue.
user@router> show system alarms 2 alarms currently active Alarm time Class Description 2016-xx-xx xx:xx:xx UTC Major PEM 1 Not OK 2016-yy-yy yy:yy:yy UTC Major Host 1 failed to mount /var off HDD, emergency /var created <<<<<<<<<<<<<<< PR1177571
In the hsl2 toolkit, there is a process which periodically checks the ASICs which communicate through it. Due to a bug in the toolkit code, the process used devalidate the very ASIC that it used to process, due to which the crash happens. PR1180010
When there is a configuration change about OAM CFM, cfmd memory leak is observed and sometime also might trigger cfmd crash info as follows. Following messages are observed: /kernel: Process (44128,cfmd) has exceeded 85% of RLIMIT_DATA: used 378212 KB Max 393216 KB PR1186694
The jpppd might crash with a core dump due to memory heap violation associated with processing MLPPP requests PR1187558
If "filter" configuration statement is present in PPPoE traceoptions configuration, the resulting log file will contain only part of messages about establishment of the interesting PPPoE session, but will contain information related to other sessions established at the moment PR1187845
SLR's/DMR's are not getting classfied to Forwarding Class when CCM configured on AE with member links from NG MPC card. PR1189254
In OAM CFM (connectivity-fault-management) scenario on AE interfaces with maintenance-domain level (for example: 3) configuration, when sending OAM CFM LBM messages with level which is smaller than configured level to ingress interface of VPWS with QinQ encapsulation, they are not dropped by ingress PE. PR1191818
MAC addresses are incorrectly assigned to interfaces by the MX-VC SCC (global) chassisd daemon, leading to duplicate addresses for adjacent FPCs. PR1202022
A CFMD core will be generated upon commit if the following conditions are met: * CFM is configured * On mis-configuration of icc format for MA (e.g. ICC name-format does not start with a character) PR1202464
For the duration of GRES, if an async message for RTTABLE is received at DCD during initialization, it might result in unexpected state changes, the traffic forwarding might be affected. This is a timing issue, it is hard to reproduce. PR1203887
When configuring "vlan-tags" for any interface, if the interface configuration is changed continually, the dcd process might memory leak. If the memory is exhausted, the dcd process might crash. PR1207233
When VRRP is configured on IRB interface with scaling configuration (300k lines), in corner case, handles might not be released appropriately after their use is over. As a result of that, memory leak on vrrpd might be seen after configuration commit. PR1208038
Access-internal route not installed for Dual Stack subscriber terminated in VRF at LNS with on-demand-ip-address PR1214337
During L2TP session establishment on MX LAC, if CPE attempts to negotiate MRU higher than 1492 bytes, spurious MRU of 1492 bytes is included into the Last Received ConfReq AVP in ICCN packet. PR1215062
In ppp subscriber scenario, if jpppd process receives the reply message from radius/tacplus server which has character of %, it might cause jpppd to crash. PR1216169
Layer 2 Ethernet Services
In DHCP environment, if interface is deleted and recreated in single commit, the duplicate DHCP subscriber is not getting bound. PR1188026
If a client sends a DHCP Request packet, and Option 55 includes PAD option (0), a DHCP ACK will not be sent back to the client. PR1201413
In dhcp relay environment, when delay-authentication and proxy mode are configured at same time. Jdhcpd may core due to NULL session ID. PR1219958
Multiprotocol Label Switching (MPLS)
In the following scenario where 1) The PHOP link goes down and the router becomes MP for a LSP. 2) After some time, NHOP link for the same LSP goes down. The router becomes PLR for the same LSP. So effectively, the router is both MP and PLR for the same LSP. In this scenario, the router sends incorrect PathErr message for the backup MP PSB. It sends "Bad strict route" PathErr instead of "Tunnel local repaired" PathErr. PR1132641
Due to Junos OS Release 15.1 enabling process rpcbind in FreeBSD by default, port 646 might be grabbed by rpcbind on startup, which causes LDP sessions failing to come up. PR1167786
RSVP signalled p2mp sub-LSP with atleast 1 or more sub-LSPs in a down state might not get re-optimized in the event of a transit core link going down. If there are no sub-LSPs in a down state at the time of re-optimization then this issue won't be seen. This can cause traffic drop over the sub-LSP which are carrying traffic which are unable to get re-optimized. This PR addresses this issue. PR1174679
On Juniper devices with "link-protection" configured and with/without "optimize-adaptive-teardown p2p" configured, rpd might crash after link flap. PR1186003
With a high degree of aggregation and a large number of next hops for the same route, ldp may spend too much CPU updating routes due to topology changes. This may result in scheduler slip and ldp session timing out. PR1192950
Packets will be out-of-order if they are Router Engine(RE) generated and go over unilist/ECMP. PR1193697
Changing the configuration under both [ protocols pcep ] and [ protocols mpls lsp-external-controller ] might trigger rpd to crash due to a race condition. PR1194068
If LDP neighbor relationship is over unnumbered interface, then flapping interface, the LDP will fail to advertise label binding. PR1202071
With two Routing Engines and ldp export policy or l2-smart-policy configured. rpd on the backup Routing Engine may crash when ldp is trying to delete a filtered label binding. PR1211194
Network Management and Monitoring
A trailing newline was erroneously added to the $$.message variable, this had undesirable effects for some use cases when using the 'event-options policy <> then execute-commands commands <>' stanza. The fix escapes any newline chars which mitigates the issue. PR1200820
Platform and Infrastructure
If IGMP snooping is enabled in a routing-instance (RI), in a very rare condition, the IGMP packets received in this RI might get dropped by firewall filter configured on loopback interface in master instance, which leads to multicast blackholing. PR1092494
Preventing an issue where one could end up with two <Junos: comment> entries under the [interfaces] stanza. PR1102086
In software versions which contain PR 1136360's code changes on MX-VC systems, when J-Flow is not configured and equal-cost multipath (ECMP) load-balanced routes occur, the linecards may stop forwarding packets after logging any of the below errors prior to possible linecard restart or offline:
- PPE Thread Timeout Trap. - PPE Sync XTXN Err Trap. - Uninitialized EDMEM Read Error. - LUCHIP FATAL ERROR. - pio_read_u64() failed.
(A possible workaround is to configure J-Flow and restart all linecards.)
In software versions which do not contain PR 1136360 solution, on MX Series Virtual Chassis (MX-VC) with "virtual-chassis locality-bias" configured, when equal-cost multipath (ECMP) load-balancing is occurring in the VC system, multicast streams and flooded Layer 2 streams may be duplicated or lost. Disabling "virtual-chassis locality-bias" from the configuration will eliminate the problem. PR1104096
Configuring one group with configuration of routing-instances and applying this group under routing-instances, then the rpd process will crash after executing "deactivating/activating routing-instances" commands. As a workaround, you can avoid using "apply-groups" under routing-instances hierarchy. PR1109924
On MX Series platform with MPC6E linecard, MPC6 only has 2 PICs (PIC number 0/1), if we try to configure an si interface with PIC number beyond range (PIC number 2) on MPC6E, it might crash, and traffic forwarding might be affected. PR1160367
In CoS environment with shaping-rate configuration under interface, if flapping that CoS interface, the shaping-rate function does not take effect. As a workaround, please deactivate/activate interfaces to avoid the issue. PR1163147
Because of an internal timer referring Time in Unix epoch (UNIX epoch January 1, 1970 00:00:00 UTC) value getting wrapped around for every 49 days, flows might get stuck for more than the period of active/inactive time out period. The number of flows that get stuck and how long they get stuck can not be deterministic exactly, which depends on the number of flows at the time of timer wrapping around. PR1173710
This PR fixes an FD (file descriptor) leak problem in MGD process when netconf traceoptions are set. If <commit> rpc is executed via netconf session, there is an FD leak in the corresponding MGD pid. PR1174696
On MX2020/2010, chassisd file rotation on commit check will cause the trace file to be stuck and no other operational chassisd events will be logged until chassisd restart. PR1177625
When graceful Routing Engine switchover (GRES) is configured, the ksyncd crashes on backup Routing Engine (RE) if a VPN static route has a network address as a next-hop. This causes that the backup Routing Engine is not ready for graceful switchover. PR1179192
The issue happens after GRES. If commit on the new master during the config sync from the old master, commit might fail. PR1179324
In IPv6 sampling environment, if flapping IPv6 routes frequently, in rare condition, due to a software defect, free of route node is not deleting it from radix node, so the Packet Forwarding Engine might crash. This is a corner case, it is hard to reproduce. PR1179776
On MX platform with LU chipset such as MPC1/ MPC2/ MPC3E/ MPC4E/MPC 3D 16x10GE or T platform with FPC type 5, if one interface is applied COS schedulers with transmit-rate percent and rate-limit parameter, then for pseudowire traffic, the traffic transmit-rate percent is not correct. PR1180427
If igmp snooping is configured in a VPLS routing instance and the VPLS instance has no active physical interfaces, multicast traffic arriving from the core might be send to the Routing Engine. As a result, host queues might get congested and it might cause protocol instability. As a workaround, configure a dummy activate interface in the VPLS routing instance can avoid this issue. PR1183382
On MX2K, the 'commit full' operation, or committing configuration under 'system' stanza(such as root-authentication and fxp0 interfaces) can cause transient Fan check Major alarm and Fan full speed. The Fan Tray spins at full speed for a while, then goes back to normal with clearing the alarm. The Fan check alarm and corresponding snmp trap are temporal, and they can be safely ignored.
user@MX2K> show chassis alarms 2 alarms currently active Alarm time Class Description 2016-05-17 19:49:57 JST Major Fan Tray X Failure 2016-05-17 19:49:57 JST Major Fan Tray Y Failure usr@MX2K> show chassis environment Class Item Status Measurement Fans Fan Tray X Fan 1 Check Fan Tray X Fan 2 Check Fan Tray X Fan 3 Check Fan Tray X Fan 4 Check Fan Tray X Fan 5 Check Fan Tray X Fan 6 Check Fan Tray Y Fan 1 Check Fan Tray Y Fan 2 Check Fan Tray Y Fan 3 Check Fan Tray Y Fan 4 Check Fan Tray Y Fan 5 Check Fan Tray Y Fan 6 Check
When MPC9E is installed in MX2K, the Fans usually keep around 6K rpm, and the fan speed control is frequently done by the Junos OS software. In this situation, when all daemons are re-evaluated(by commit full or config change under system stanza), the software bug causes the fan status to be checked within quite small period, then the Junos OS software recognizes that the fan is faulty because the fan speed has not reached the target speed yet when the fan status is checked within the small period. After the fan alarm is detected, the fans are expected to start working with full speed to cool the system components.
The fan status check logic is fixed by this PR. The fan status is checked after the fan speed is stabilized, hence we do not see this transient fan alarm. PR1185304
In a very rare scenario, during TAC accounting configuration change, auditd daemon crashes due to a race condition between auditd and its sigalarm handler. PR1191527
VPLS: FPC CPU goes high for several minutes when mac/arp are learnt via lsi interfaces. The FPC CPU goes high during the learning phase and issue can be seen with various triggers that result in mac/arp re-learning e.g. mac flush, FPC reboot or link flap resulting in mac flush etc. For agent smith cards (MPC 3D 16x 10GE), the CPU may remain high for upto 30 minutes on learning/re-learning of 10k arp/mac via irb lsi interfaces Problem is only seen if there are ARPs learnt in bulk over irb lsi interfaces. PR1192338
Insertion of an offlined MPC6E into the MX2K chassis can cause the FPC Temp sensor to detect transient "WARM TEMP" condition, and the chassis FAN in the same zone goes to high speed.
*** messages ***
Jul 12 18:10:17.698 MX2K-re0 chassisd[xxxx]: CHASSISD_SNMP_TRAP7: SNMP trap generated: FRU insertion (jnxFruContentsIndex 7, jnxFruL1Index 3, jnxFruL2Index 0, jnxFruL3Index 0, jnxFruName FPC: MPC6E 3D @ 2/*/*, jnxFruType 3, jnxFruSlot 2) MX2K-re0> show chassis zones |refresh 2 ---(refreshed at 2016-07-12 18:10:18 JST)--- ZONE 0 Status Driving FRU FPC 2 Temperature 63 degrees C / 145 degrees F Condition WARM TEMP <------------------------ Warm temp is detected Num Fans Missing 0 Num Fans Failed 0 Fan Duty Cycle 27 ZONE 1 Status Driving FRU SFB 5 SFB-XF2-Zone1 Temperature 59 degrees C / 138 degrees F Condition OK Num Fans Missing 0 Num Fans Failed 0 Fan Duty Cycle 27 ---(refreshed at 2016-07-12 18:10:20 JST)--- ZONE 0 Status Driving FRU FPC 2 Temperature 63 degrees C / 145 degrees F Condition WARM TEMP <------------------------ Warm temp is detected Num Fans Missing 0 Num Fans Failed 0 Fan Duty Cycle 27 ZONE 1 Status Driving FRU SFB 5 SFB-XF2-Zone1 Temperature 59 degrees C / 138 degrees F Condition OK Num Fans Missing 0 Num Fans Failed 0 Fan Duty Cycle 27 ---(refreshed at 2016-07-12 18:10:22 JST)--- ZONE 0 Status Driving FRU FPC 2 Temperature 63 degrees C / 145 degrees F Condition OK Num Fans Missing 0 Num Fans Failed 0 Fan Duty Cycle 27 ZONE 1 Status Driving FRU SFB 5 SFB-XF2-Zone1 Temperature 59 degrees C / 138 degrees F Condition OK Num Fans Missing 0 Num Fans Failed 0 Fan Duty Cycle 27
Jul 12 18:10:27.489 MX2K-re0 chassisd[xxxx]: Fan Tray 0: zone 0 fan_speed current 27% target 50% raising ratio 0.80 (linear) FPC 2 temp 72 last 72 WTC 55 WT 60 high limit 75 i2c_ratio 0.80 Jul 12 18:10:27.490 MX2K-re0 chassisd[xxxx]: Fan Tray 0: set fan_speed to 50% cfg_speed 50% (linear) Jul 12 18:10:27.492 MX2K-re0 chassisd[xxxx]: Fan Tray 1: zone 0 fan_speed current 27% target 50% raising ratio 0.80 (linear) FPC 2 temp 72 last 72 WTC 55 WT 60 high limit 75 i2c_ratio 0.80 Jul 12 18:10:27.492 MX2K-re0 chassisd[xxxx]: Fan Tray 1: set fan_speed to 50% cfg_speed 50% (linear) Jul 12 18:10:47.517 MX2K-re0 chassisd[xxxx]: Fan Tray 0: zone 0 fan_speed current 50% target 27% falling ratio 0.00 (linear) SFB 2 SFB-XF0-Zone0 temp 63 last 63 WTC 70 WT 75 high limit 90 i2c_ratio -0.60 Jul 12 18:10:47.517 MX2K-re0 chassisd[xxxx]: Fan Tray 0: set fan_speed to 27% cfg_speed 27% (linear) Jul 12 18:10:47.519 MX2K-re0 chassisd[xxxx]: Fan Tray 1: zone 0 fan_speed current 50% target 27% falling ratio 0.00 (linear) SFB 2 SFB-XF0-Zone0 temp 63 last 63 WTC 70 WT 75 high limit 90 i2c_ratio -0.60 Jul 12 18:10:47.520 MX2K-re0 chassisd[xxxx]: Fan Tray 1: set fan_speed to 27% cfg_speed 27% (linear) PR1193273
A rare VMCORE can occur caused due to process limit being breached by too many RSHD children processes being created PR1193792
After system boot up or after PSM reset we may see "PSM INP1 circuit Failure" error message PR1203005
When a Netconf <get route information> RPC is executed for all routes via ssh transport session and the session is terminated before all the route information is retrieved, the MGD process and RPD daemon will cause high CPU utilization for an extended period of time. Example of issues caused by this high CPU utilization for an extended period is as follow: BGP neighbors holddown timer expires and become ACTIVE OSPF adjacencies reset during database exchange OSPF LSA retransmissions events on neighboring nodes due to missing ACKs LDP sessions time out non distributed BFD sessions being reset due to missing keepalives PR1203612
If Inline JFlow is configured in scaled scenarios, Inline JFlow Sampler route database is taking huge time to converge. PR1206061
When "commit confirmed" is used after performing some changes, and an empty commit is performed to confirm the changes, the previous changes related processes will be notified again which is unnecessary. It might cause session/protocol flap. PR1208230
If a Unicast or Multicast source sends a fragmented packet (a packet which exceeds the MTU of its outgoing interface) to the router and it needs to resolve the destination route, then only the first fragment of the packet is sent when the route it resolved. PR1212191
On MX2K, MIC output is seen when there is no MIC in MPC under "show chassis hardware detail".
Steps to reproduce the issue: 1. offline MPC 2. physically remove MPC 3. physically remove MIC from the MPC 4. reinsert MPC 5. online MPC
usr@MX2K> show chassis hardware detail |find fpc FPC 0 REV 68 750-044130 ABDA1879 MPC6E 3D CPU REV 12 711-045719 ABDA1735 RMPC PMB MIC 0 REV 14 750-049457 ABCY5322 2X100GE CFP2 OTN >>>>>>>> No MIC inside MIC 1 REV 26 750-046532 ABCZ3853 24X10GE SFPP >>>>>>>>>>No MIC inside XLM 0 REV 13 711-046638 ABDA1859 MPC6E XL XLM 1 REV 13 711-046638 ABDA1787 MPC6E XL PR1216413
This rmopd core was caused by the NULL pointer in SW function. PR1217140
When any MPC line card is offlined, it goes offline via all offline flows and connection is cleaned, but in the end of the offline flow, somehow it delays powering off the line card. The chasd powers off the MPC via I2cs write the respective power registers, but in hardware it is not really powering off. As a consequence, since MPC is still power-on but connection is down, it will try to reconnect, then start to come up automatically within 10 secs. It occurs sometimes (not all the times). PR1222071
Routing Policy and Firewall Filters
With rib-groups configured for importing routing information to multiple routing tables, unexpected route refresh might happen when committing configuration change, due to a defect in code related to secondary table list handling. PR1201644
From Junos OS Release 15.1, memory leak on policy_object might be observed if the configuration of policies is added and deleted in high frequency. Not all polices make memory leak, and only the container policy referred in policy statement hits this issue: the "from" in policy invokes the terms which is defined in policy-options, e.g. community, as-path, prefix-list. This is the configuration example. set policy-options prefix-list pl set policy-options policy-statement from prefix-list pl PR1202297
Routing Protocols
Junos OS exhibits two different next-hop advertisement behaviors for MP_REACH_NLRI on a multi-hop eBGP session, based on whether it is loopback peering or physical interface peering. When the routers are peering on their loopback, only the global IP of the interface (lo0) is advertised, whereas when the routers are peering through the physical interface, both global and link-local address are advertised as the NHs. PR1115097
When BGP speaker has multiple peers configured in a BGP group and when it receives the route from a peer and re-advertises route to another peer within the same group, MIB object "jnxBgpM2PrefixOutPrefixes" to the peers in the same group reports the total number of advertised prefixes in the group. MIB value "jnxBgpM2PrefixOutPrefixes" is defined as per peer basis but it looks as if it is per group basis. As a workaround, we can get the number of advertised prefixes from CLI command "show bgp neighbor" instead. PR1116382
When Bidirectional Forwarding Detection (BFD) is configured, after changing the MTU (between 1514 and 9192) of physical interface (IFD) where the BFD session is located, 2 issues might be seen as below. Issue 1: after link flapping, the BFD session may not come up due to incorrect mapping. Issue 2: there might be stale BFD sessions. This issue may also be seen when changing the interval from aggressive to a very less aggressive interval (e.g. change to 2 sec). PR1116666
On Junos OS based products, changes in routing-instance, like changing route-distinguisher or routing-option changes in some corner cases might lead to rpd crash. As a workaround always deactivate routing-instance part that is to be changed before committing the changes. PR1134511
When we have a route received from different eBGP neighbors, for this specific route, if all BGP selection criteria is matching, we will end up using router ID. As this is eBGP route, so BGP will use active route as the preferred one. Now if this specific route flapped with sequence from the non-preferred to the preferred path, RPD will run the path selection. During RPD path selection we might generate a core file. This issue has no operational impact, also a workaround is available to avoid this issue. PR1180307
Please refer to the following topology. If the opposite Router's interface "A" is down by "disable/deactivate/delete" configuration, BFD timeout detection might be long delay. Topology +-----+ | DUT | OSPF | |-------------+ +-----+ | A | | | | | | | +------+ OSPF(p2p) | | R2 | bfd | | | | | +------+ | | | V intf A | | +-----+ | | R1 |-------------+ | | OSPF +-----+ PR1183353
If we have post-policy BMP configured & import policy rejects the route making it hidden, we will still periodically send this Unreachable Prefix to the BMP station.
May 17 15:45:05.047931 bmp_send_rm_msg called, found post-policy prefix 101.66.66.66/32, peer 10.0.1.1 (External AS 65101), station BMP_STATION_2. May 17 15:45:05.047943 import policy rejected post-policy prefix 101.66.66.66/32, peer 10.0.1.1 (External AS 65101), station BMP_STATION_2. May 17 15:45:05.047986 generating post-policy delete for prefix 101.66.66.66/32, peer 10.0.1.1 (External AS 65101), station BMP_STATION_2. May 17 15:45:05.048001 BMP: type 0 (RM), len 76, ver 3, post-policy, for Peer 10.0.1.1, station BMP_STATION_2. May 17 15:45:05.048018 Peer AS: 65101 Peer BGP Id: 10.0.1.1 Time: 1463492684:0 (May 17 13:44:44) May 17 15:45:05.048027 Update: message type 2 (Update) length 28. May 17 15:45:05.048034 Update: Unreachable prefix data length 5. May 17 15:45:05.048047 Update: 101.66.66.66/32 PR1184344
Any configuration change can cause deletion of a firewall filter created for a routing instance if the flowspec routes in that instance are imported using rib-group, and there is no "inet-vpn flow" address family configured and the routing instance does not have any BGP group configured with "inet flow" address family. PR1185954
On the RSVP LSP scenario with ISIS TE configured, memory leak might happen in rpd and Packet Forwarding Engine after the LSP re-optimization, and this migth cause FPC crash. PR1187395
The rpd might crash when printing the socket address of type inet6 flow address family while the buffer is not sufficient to print decimal number. PR1188502
Multicast routing table displays inconsistent MoFRR state after activating/deactivating MoFRR. This is a cosmetic issue and has no impact on traffic. PR1194729
On executing "show task replication" command, IS-IS could be shown as "Complete" if IS-IS is not configured on the device. If IS-IS is configured, the replication will be shown correctly (NotStarted/InProgress/Complete). No other functionality impacted. PR1199596
The VRF related routes which are leaked to the global inet.0 table and advertised by the access routers are not being advertised to global inet.0 table on the core. PR1200883
With nonstop-routing (NSR) enabled, all running protocols include PIM and NG-MVPN will be replicated, if NSR is disabled only under PIM "set protocol pim nonstop-routing disabled", this will remove both PIM and NG-MPVN from replicated list, then adding PIM NSR again by "delete protocol pim nonstop-routing disabled" will not work as expected and PIM will not be added. PR1203943
In a situation which a BGP route is resolved using a secondary OSPF route which is exported from one routing-instance to another routing-instance. If the BGP route is being withdrawn while the OSPF route is deleted, rpd might restart unexpectedly. PR1206640
BGP routes are rejected as cluster ID loop prevention check fails due to a mis-configuration. But when the mis-configuration is removed BGP routes are not refreshed. The fix of this issue will send a soft route refresh dynamically when a cluster ID is deleted. PR1211065
If a NSR enabled router is providing graceful restart support for a restarting peer, and the standby is unconfigured, then rpd may core on the standby during the shutdown. PR1212683
EBGP peer may remain "Idle" at NSR backup-Routing Engine, after Interface-down event PR1215855
Services Applications
On MX platform, when using MS-MPC, the "idpd_err.date" error message is filling var/log. Please refer to KB30743 for details. PR1151945
During "commit synchronize" operation, when commit gets executed on backup Routing Engine, system is idling for 10 seconds after the following operation (can be observed with "commit synchronize | display detail"): 2016-07-07 10:30:04 CEST: Spawning IPSec Key Management daemon to check new configuration This slows down the whole commit process exactly by 10 seconds. Issue can only be seen when IPSec is configured and, therefore, IPSec Key Management daemon (kmd) is running (needed by configuration). PR1185504
When using MS-DPC under heavy load condition (e.g. with about 7m flows) with deterministic NAT and port block allocation (PBA) scenario, in rare condition, MS-DPC crash may occur due to memory issue. PR1186391
Attempting to ping a subscriber address from the L2TP LNS CLI will fail. PR1187449
Issue happens in specific corner cases and Acceptable workaround is available. If we bring down the complete subscriber and bring it back up again. Family bring up will work. PR1190939
When using NAT on the MX Series the FTP ALG fails to translate the PORT command when the FTP client using Active Mode requests AUTH(SSL-TLS) and the FTP server does not use AUTH PR1194510
When MS-PIC is running on T640/T1600/T4000, the number of maximum service sets is wrongly limited to 4000, instead of 12000. This might impact in scaled service (IPsec, IDS, NAT, Stateful firewall filter, etc) environment. PR1195088
After upgrading M series router (LNS) to 15.1R4.6, it was observed that L2TP sessions are not coming up due to PPP CHAP authentication failure. L2TP control messages are sent/received and tunnel id is obtained. PPP LCP is also successful. During PPP CHAP phase only Challenge and Response messages are present and then L2TP CDN is initiated. PR1201733
When configuring Network Address Translation (NAT) service, the service route is still available in route table even after disabling service interface. Any types of service interfaces (except ams- interface) that supports NAT might be affected. PR1203147
On MX series with L2TP configured, for some reason the L2TP packet in ICRQ retransmission message is set to incorrect value, and this causes frequent L2TP session flaps. PR1206542
On MX Series routers with subscriber management feature enabled used as a LAC (L2TP Access Concentrator), a small amount of memory leak is leaked by jl2tpd process on the backup Routing Engine when subscriber sessions are logged out. PR1208111
Subscriber Access Management
In DHCP relay scenario, DHCP relay binding might get stuck in "RELEASE(RELAY_STATE_WAIT_AUTH_REQ_RELEASE" state due to the LOGOUT Request is not processed correctly by authentication manager process (authd) if there were multiple attempts to activate Lawful Intercept (LI) for this DHCP subscriber using RADIUS change of authorization (CoA) packets in quick succession. PR1179199
If aborting "test aaa ppp" command with Ctrl-C, due to a software defect, when subscriber logout, the system does not wait for logout response, subscriber is immediately removed. Because of this, dfwd daemon is not able to clear filters in time and results in stale entries. The stale info might affect subscriber login and logout. PR1180352
In the event, such as JSRC re-sending a PPR with a policy-install for an already installed policy or policy-remove for a non-existing policy (resulting if the SRC goes down after issue the PPR but before receiving or preserving the response), the outcome of the processing is to “do-nothing” which results in a different code path.PR1189020
On EX2200/EX3300 series switches configured dhcp-local-server, it brings up a few (say 6 or more) or all interfaces which is under dhcp-local-server hierarchy at once then the authd process continually core dumps causing the switch get in stuck and resulting in packet drop. PR1191446
When destination-override is used(root@user# set system tracing destination-override syslog host <host ip>), the userAccess events are not sent to the external syslog server. PR1192160
On MX series platform, when using RADIUS dynamic requests for subscriber access management, if the device detects that the CoA-Request it received is same with the one in processing progress, the router would send CoA-NAK packet back to the RADIUS server with incorrect code 122 (invalid request) wrongly, before sending CoA-ACK packet in response to the original CoA-Request that was being processed. In this case the router should ignore all RADIUS CoA-Request retries and respond only to the original CoA-Request packet. PR1198691
Incorrect service-accounting name in radius accounting record if service activated by SRC PR1206868
If RADIUS return Framed-route="0.0.0.0/0" to a subscriber terminated on Junos OS platform, this subscriber can not login due to authentication error. PR1208637
On MX Series routers with subscriber management feature enabled, after GRES switchover "show network-access aaa statistics radius" CLI command display only zeros and "clear network-access aaa statistics radius" doesn't clear statistics as it should. It's a cosmetic issue and communication with Radius server is working fine, the only impact is that affected CLI commands do not work as expected. PR1208735
If radius Primary-WINS(Juniper-ERX-VSA) is set as 0.0.0.0, subscribers is rejected by Authd and doesn't negotiate further. PR1209789
Commit error: "Radius-Flow-Tap LSRI "" is in use by subscriber, cannot be removed from the configuration" might be seen after two consecutive GRES switchovers if a subscriber with lawful intercept mirroring enabled was logged in before the switchovers. PR1210943
User Interface and Configuration
When entering the "restart r" incomplete command in the CLI, the command "restart routing" is executed. It should throw an error like "error: invalid daemon: r". PR1075746
Config database is locked by "root" user when trying to commit vpls circuit configs in "config exclusive" mode. PR1208390
If user enter configuration mode with "configure exclusive" command, after configuration is automatic rollback due to commit un-confirmed, user still can make configuration changes with "replace pattern" command, the subsequent commit fails with "error: access has been revoked". After exit configuration mode, user fail to enter configuration mode using "configure exclusive" with "error: configuration database modified". PR1210942
When persist-groups-inheritance is configured and you issue a rollback, it will be seen that the configuration is not propagated properly after a commit. PR1214743
VPNs
In a multi-homed source topology in NG-MVPN (applicable to both inter-AS and intra-AS scenario), there are two problems: The first problem is Multicast (S, G) signaling doesn't follow RPF. When the routing table (mvpninstancename.inet0) has two routes, due to the policy configuration, the best route to the source is via the MPLS core, but Multicast (S, G) PIM join and NG-MVPN Type 7 both point to inactive route via local BGP peer. The second problem is when "clear pim join instance NG" is entered, the multicast forwarding entries are wiped out. PR1099720
After a GRES with NSR enabled, in NG-MVPN scenario, on the new backup RE RPD is consuming more than 90% CPU. This issue happens rarely and it is not reproducible. PR1189623
In BGP VPLS environment, sometimes we receive routes from BGP with invalid next-hop related information. In such scenarios, VPLS should treat them as bad routes and not send them to rpd infra for route resolution. Due to a software defect, the bad routes are passed to the route resolver, which might lead to rpd process crash. The routing protocols are impacted and traffic disruption will be seen due to loss of routing information. PR1192963
With MVPN and NSR enabled, high CPU on backup Routing Engine might be seen. MVPN on backup Routing Engine is re-queuing c-mcast events for flows as it is unable to find phantom routes from master routing-engine. However as routes is not reaching from master Routing Engine so backup Routing Engine keeps trying causing high CPU triggered by rpd processing. PR1200867
In MVPN mode SPT-only, the first multicast packet is lost when the source is directly connected to the PE. PR1204425
Resolved Issues: 15.1R4
Class of Service (CoS)
When customers delete an IFL from an interface-set that has CoS applied to it and activate CoS profile directly on that IFL in one single commit, commit fails with an error. Commit goes through if they do it one by one, delete IFL from interface set, commit and then activate CoS on that IFL, commit. PR1169272
Forwarding and Sampling
Configuration statement "interface-mac-limit" might be set to default value when activating "mac-table-size" on a VPLS routing instance. Restarting l2ald, reapplying the "interface-mac-limit" or changing to another value (set interface ge-3/1/0.0 interface-mac-limit 510) fixes the issue. user@router> show vpls statistics | match count Current MAC count: 0 (Limit 1024) << set to default value 1024 instead of the value set by interface-mac-limit PR1025503
In some rare cases, SNMP might get Output bytes of Local statistics instead of the Traffic statistics when retrieving Output bytes of Traffic statistics on a logical interface. PR1083246
When using MX Series-only features (gre decapsulate or payload protocol in IPv6), a change of policers or counters to an existing firewall filter using physical-interface-filter or interface-specific configuration statements will not be correctly detected by MIB2D. PR1157043
Configuration container [protocols] [l2-learning] [global-mac-move] is made visible. The functionality under it are already supported but the command was hidden till now. PR1160708
Configuration is restricted to include uid variables in variable expressions Please find the following example as below root@R1# show dynamic-profiles SERVICE-PROFILE variables input-filter { mandatory; uid-reference; } input-bw mandatory; output-filter { mandatory; uid-reference; } output-addr1 mandatory; output-addr2 mandatory; fin1-uid uid; fout1-uid uid; fout2-uid uid; policer1-uid uid; prefix1-uid uid; term-var equals "ifNotZero ($output-addr1,'voice:'##$fout2-uid##':'##$fout1-uid)"; root@R1# commit error: syntax error in profile SERVICE-PROFILE variable term-var error: syntax error in variables stanza in profile SERVICE-PROFILE error: foreign file propagation (ffp) failed. PR1168994
This issue will be seen only when there are huge number of routes having different BGP NHs pointing to the same AS. Depending on the number of routes pointing to AS paths and also the difference in BGP NHs in the routes can shoot up the SRRD CPU consumption. In the real network this issue might not be seen often, as the number of AS paths will be huge and the routes referring these AS paths will be usually distributed among the AS paths. Even if the routes are pointing to the same AS, the impact would be lesser than the one seen in this PR. PR1170656
When polling SNMP counters for MX series-Only firewall filters, MIB2D_RTSLIB_READ_FAILURE cosmetic error messages might get reported in syslog. PR1173057
statistics-service daemon (pfed) experiences constant memory leak of 10 KB every 2 minutes when MobileNext package is installed: > show version Model: mx480 Junos: 14.1X55-D30.10 JUNOS Base OS boot [14.1X55-D30.10] <...> JUNOS MobileNext Routing Engine Software [14.1X55-D30.10] <<< this package. PR1174193
Even if packets do not match firewall filter conditions, wildcard mask firewall filter might match any packets. << Sample config >> ------------------------------------------------- set firewall family inet filter TEST-filter term TEST1 from destination-address 0.0.0.255/0.0.0.255 <<<<<< set firewall family inet filter TEST-filter term TEST1 then count TEST1 set firewall family inet filter TEST-filter term TEST1 then discard set firewall family inet filter TEST-filter term TEST2 then accept ------------------------------------------------- This is discard filter for /24 prefix broadcast address. However it might discard other packets. PR1175782
This is cosmetic issue. During sampling with jflow version 9, bfd packets from MPLS-TP were shown like as ip packets in "show services accounting aggregation template template-name XXX" command. (Actually, bfd packets info is not sampled by jflow.) << example >> ********************************************************************** lab@router-re0> show services accounting aggregation template template-name mpls Src Dst Port/ Port/ Top MPLS MPLS MPLS Source Destination ICMP ICMP Label Label 1 Label 2 Label 3 Address Address Type Code Proto TOS Address 299776 13 0 0.0.0.16 0.1.134.160 0 0 0 0 100.100.100.3 <<<<< bfd packet 299776 13 0 0.0.0.17 0.1.134.160 0 0 0 0 100.100.100.3 <<<<< bfd packet 299776 16 0 10.0.0.1 40.0.0.2 8 0 1 0 100.100.100.3 <<<<< ping 299792 16 0 40.0.0.2 10.0.0.1 0 0 1 0 100.100.100.1 <<<<< ping 299776 16 0 40.0.0.2 10.0.0.1 0 0 1 0 100.100.100.3 <<<<< ping ********************************************************************** << sample topology >> ********************************************************************** MPLS-TP(OAM, BFD) <--------------------> 10.0.0.1 40.0.0.2 sampling [CE1]-------[PE1]-------[DUT]-------[PE2]-------[PE2] | | [collector] ********************************************************************** PR1177876
In Junos OS Release 15.1 and later, family vpls filter applied to ae-interface is not working. PR1178743
SRRD daemon does not delete routes when the DELETE is received from RPD in few configuration cases. This results in build-up of memory in SRRD daemon and once SRRD reaches the limit, it crashes and restarts itself. This happens only when none of the SRRD clients (FPCs in Inline JFlow case and PICs in PIC based sampling) are interested in one or more families. Say, only IPv4 family is configured in all the clients and, IPv6 and MPLS families are not configured for Sampling in any of the clients. PR1180158
General Routing
An EVPN with support for inter-subnet routing using an irb interface may experience a crash and restart of rpd, leaving a core file for analysis. In this case, EVPN MAC routes contain MAC+IP, and this IP/32 is installed in Routing Instance table on egress router. Core is triggered in the IP/32 route installation flow. There is no special trigger point- it is a timing issue with basic irb configurations. PR992059
An inconsistency between JUNIPER-VPN-MIB and MPLS-L3VPN-STD-MIB with the number of interfaces for an routing-instance has been identified. For example with the following configuration: user@router-re0> show configuration routing-instances ri1 instance-type vrf; interface ge-2/0/8.10; interface lo0.10; route-distinguisher 65000:1; vrf-target target:65000:1; vrf-table-label; According to the MPLS-L3VPN-STD-MIB there are two interfaces in this routing-instance: MPLS-L3VPN-STD-MIB :: mplsL3VpnVrfAssociatedInterfaces: OID: 1.3.6.1.2.1.10.166.11.1.2.2.1.8 Description: Total number of interfaces connected to this VRF (independent of ifOperStatus type). {master} user@router-re0> show snmp mib walk 1.3.6.1.2.1.10.166.11.1.2.2.1.8 mplsL3VpnVrfAssociatedInterfaces.3.114.105.49 = 2 However according to JUNIPER-VPN-MIB there are three interfaces in this VRF: JUNIPER-VPN-MIB :: jnxVpnIfStatus OID: 1.3.6.1.4.1.2636.3.26.1.3.1.10 Description: Status of a monitored VPN interface. user@router-re0> show snmp mib walk 1.3.6.1.4.1.2636.3.26.1.3.1.10 jnxVpnIfStatus.2.3.114.105.49.733 = 5 jnxVpnIfStatus.2.3.114.105.49.754 = 5 jnxVpnIfStatus.2.3.114.105.49.774 = 5 The interfaces in the example are: {master} user@router-re0> show snmp mib walk 1.3.6.1.2.1.2.2.1.2 ifDescr.733 = ge-2/0/8.10 ifDescr.754 = lo0.10 ifDescr.774 = lsi.0 The fix for this issue adjusts this by removing the dynamic interface (in this case, lsi.0) from the interface list of JUNIPER-VPN-MIB. PR1011763
The L2ald may crash after interface flap. PR1015297
CoS scheduler names cannot be added or changed via service COA's. The schedulers can be added at subscriber login using client dynamic profiles. PR1015616
When ps interface is configured using as anchor interface, a logical tunnel (lt) interface without explicit tunnel-bandwidth configuration (under 'chassis fpc <fpc number> pic <pic number> tunnel-services' configuration hierarchy), the ps interface is created only in kernel, but not on Packet Forwarding Engine. In order to have ps interface in Packet Forwarding Engine, an explicit tunnel-bandwidth configuration is required. PR1042737 removes this restriction, and a ps interface may be anchored to an lt interface without explicit tunnel-bandwidth configured. PR1042737
IPV6 RA is not including source link address option on ps.x pseudowire interfaces. PR1049952
Wrong byte count was seen in the ipfix exported statistics packets for mpls flows. This issue is taken care now. PR1067084
There are some configuration related functions in rpd and l2cpd that use special Memory API called Lite Pools. These pools when reset were not freeing control information related to the pool and hence resulting in a leak. This is not a day one issue. This bug was introduced in 15.1 when we reimplemented LIBJTASK memory subsystem. This PR impacts all daemons using LIBJTASK (including rpd) on all platforms provided memory lite pools are used by those daemons. PR1071191
PCE-initiated LSPs are less preferred than locally configured LSPs. After this issue is fixed, PCE-initiated LSPs will have same preference as locally configured LSPs. PR1075559
The Enhanced LAG feature is enabled in network-service enhanced-ip mode, but it is not supported in enhanced-Ethernet mode. PR1087982
Certain VTY JNH commands (see description of this PR-1094955) on MX Series platforms will not decode properly, would need this PR fix. PR1094955
On MX Series routers where MS-MIC or MS-MPC is inserted, certain combinations of fragmented packets might lead to an MS-MIC or MS-MPC coredump. PR1102367
On MX Series platforms, in rare condition, if Packet Forwarding Engine sends wrong Packet Forwarding Engine id to chassisd as part of capability message, kernel might crash and some FPCs might be stuck in the present state, the traffic forwarding will be affected. This is a corner case, it is not reproduced consistently. PR1108532
Fixed problem with "egress pfe unspecified" increase when bind dhcp relay (or fpc restart caused ospf connection lose. Not able to ping its neighbor, arp table is fine, got egress Packet Forwarding Engine unspecified). PR1114132
ANCP is not supported in this release.Attempts to use ANCP related show commands will result in a timeout. PR1121322
With IPv6 access route configured in dynamic profile, when the router receives IPv6 SOLICIT message which request only Prefix Delegation but no IPv6 address, the access route will not be installed successfully. PR1126006
RPD crash might be seen during deletion of address family on an interface while rpf check is configured. PR1127856
The rpd might crash when local-switching is configured with connection-protection for L2Circuit. This problem only occurs after setting composite-next-hop for L2Circuit using set routing-options forwarding-table chained-composite-next-hop ingress l2ckt. PR1129940
When using Point-to-Point Tunneling Protocol (PPTP) Application Layer Gateways (ALG) on MS-MPC/MS-MIC, if running scaled number of PPTP sessions control and data sessions (e.g. 1M sessions) for long hours (e.g. more than 8 hours), when the traffic is stopped, the "Bytes used" field of the output of CLI command "show services service-sets summary" will show a randomly large value due to memory issue. PR1131605
In a situation where both mirrored interface and mirrored destination are on MPC card and mirror destination interface is a unilist next-hop(e.g. an ae interface), mirrored packets may get dropped. PR1134523
Kernel crash might be seen due to integer wrapping around in case of 64 bit architecture. PR1134578
Insufficient time to allow an MPC5/MPC6 card to lock on the clocking source during FPC boot time might cause the Major Alarm raised due to "PLL Error." PR1137577
MIC-3D-16CHE1-T1-CE only supports 4 queues by default due to the incorrect setting in code, this is a very minor change to make MIC-3D-16CHE1-T1-CE support 8 queues by default. PR1138270
After removing a child link from AE bundle, the AE interface statistics in the SNMP MIB might show a spike. PR1140533
When DHCP subscribers are brought up on the static interface IFL with interface-set, and this static interface IFL shares multiple DHCP stacks, it is possible that the interface-set does not get deleted when all DHCP subscriber are brought down on this static IFL. Unable to delete interface-set leads to commit denies on the dynamic profile involved. PR1145450
Twice-NAT translation type does not work with the MS-MPC and MS-MIC service cards. The older MS-DPC cards does support his translation type. PR1145690
With a 100G CFP2 MIC installed in a MPC6E FPC. If the FPC fails to initialize the MIC, it is very likely that the FPC will get into boot loop. PR1148325
Subscriber traffic in an LNS coming from the core network is not switched properly when the incoming interface is an irb interface. PR1148533
In EVPN environment, when CE MAC address alone gets changed for a MAC+IP entry, new MAC+IP entry is not getting reflected in EVPN database and the old entry still exists on PE router. PR1149340
During deactivation of interfaces in a scaling setup the Packet Forwarding Engine may reboot or Packet Forwarding Engine may notice next-hop corruption. PR1151844
From Junos OS release 14.2 with "exclude-hostname" configuration, hostname is not excluded from the messages before forwarding. This is a minor case, no other service impact. PR1152254
Routers using inline layer 2 services may experience Packet Forwarding Engine wedge leading to fabric degradation and FPC restart. During issue state, the affected FPC will not be able to transmit and traffic will be fully blackholed. This problem is amplified by fragmented and out of order packets. This log entry may be seen during the error state: Host Loopback:HOST LOOPBACK WEDGE DETECTED IN PATH ID 0. PR1153750
CE in an EVPN setup which has no-mac-learning or is otherwise forwarding traffic upstream to MX's in an Active/Active EVPN configuration will see split horizon broken by the MX PE which has the MAC as DRC status. PR1156187
After MIC "MIC-3D-4OC3OC12-1OC48" reboot, we might see below logs filling syslog message : router-re0 fpc2 cc_mic_sfp_is_present:????????????????????????????????????????????????????? ?????????????????????????^^??^P-sM-^T^S?? - Device is not SFP type router-re0 fpc2 cc_mic_sfp_periodic: Link 0 SFP - plugged in. router-re0 fpc2 cc_mic_sfp_is_present:????????????????????????????????????????????????????? ?????????????????????????^^??^P-sM-^T^S?? - Device is not SFP type [LOG: Err] cc_mic_sfp_is_present:????????????????????????????????????????????????????? ?????????????????????????5?x??l?8 - Device is not SFP type [LOG: Err] cc_mic_sfp_is_present:????????????????????????????????????????????????????? ?????????????????????????5?x??l?8 - Device is not SFP. PR1156353
"op 8 (COS Blob) failed" messages may be seen in syslog for vmx when we reboot the FPC. PR1156450
Given an active BGP multipath route with 2+ Indirect-Next-Hops and another BGP route which can participate in protocol independent multipath with router-next-hop, rpd might crash if the interface on which first member of Indirect-Next-Hop resolves goes down. PR1156811
On MX Series platforms supporting MPC3E or MPC4E type MPC, the single-hop BFD session configured under a routing-instance (RI) can flap intermittently. The problem would be seen when the main-instance loopback firewall filter discards/rejects the BFD packets OR has term to accept only BFD packets from neighbors configured under main instance. In both scenarios, the BFD session packets coming on routing-instance will be wrongly matched to main-instance loopback filter and gets discarded. With the fix of this issue, this situation is avoided and BFD session packets from routing-instance will be matched with the correct RI loopback filter (if configured). Note: In case there is no RI loopback interface configured, then BFD packets are matched against main-instance loopback filter. PR1157437
From Junos OS Release 13.2R1 and later, Packet Forwarding Engine interfaces on MX Series with MPCs/MICs-based line cards might remain down after performing "request system reboot both-routing-engines " or "restart chassisd" several times. Reboot the FPC might restore it. PR1157987
RPD may crash after EVPN was configured when extra bits in the ESI label extended community are set besides the single-active bit. PR1158195
On MX Series platforms, when MPC experiences a FATAL error, it gets reported to the chassisd daemon. Based on the action that is defined for a FATAL error, the chassisd will take subsequent action for the FATAL error. By default, the action for FATAL error is to reset the MPC. When the MPC reports FATAL error, chassisd will send offline message and will power off the MPC upon the ACK reception. However, if MPC is in busy state for any reason, the ACK doesn't come in time and hence there would be a delay in bringing down the MPC. The fix ensures to bring down the MPC in time upon FATAL error. PR1159742
In cases when the subscriber stacking is IPV6 over LNS, the IPV6 subscribers fails to come up with RPF check configured. DHC IPV6 subscriber over LNS comes up fine when RPF check configuration is disabled or removed. PR1160370
Software OS thread on the line card is doing a busy loop by reading the clock directly from hardware. Sometimes it seems the thread is getting wrong values from HW register and waiting forever in the busy loop. After the busy loop crosses a certain time period, the line card crashes and reboots. This is a rare condition. PR1160452
On MX Series routers with enhanced queuing DPCs, there is a memory leak whenever doing SNMP walk to any of COS related OID's or issue the command "show interfaces interface-set queue <interface set name>". PR1160642
The Router Lifetime field is set to 0 in the first Routing Advertisement sent from LNS back to PPPoE subscriber. PR1160821
The VCCPD_PROTOCOL_ADJDOWN system log message does not include a 'reason' string to explain why the virtual chassis adjacency was terminated. This information will now be present in the message. PR1161089
When FPC goes to terminated state (FPC down, restarts) ACI interface-set does not get deleted. After FPC becomes online further subscriber bring up on this ACI interface-set fails. PR1161810
Subscriber where TCP is attached to the underlying IFL will errantly end up in the control IFL queue. Workaround is to attach a TCP profile to each subscriber IFL. PR1162108
Interfaces routing status message xxx.xxx.xxx.xxx <Up Broadcast> may be reported on an interface that is not associated with the config change, such as bridge-domain addition. It should be reported only if there is any change in the IFL parameters. This is an info(6) level message for debug purpose, so we can safely ignore the cosmetic problem. rpd[xxx]: %DAEMON-6: EVENT Flags ge-1/0/4.0 index 371 10.180.230.8/24 -> 10.180.230.255 <Up Broadcast> rpd[xxx]: %DAEMON-6: EVENT Flags irb.110 index 326 10.9.17.254/22 -> 10.9.17.255 <Up Broadcast> rpd[xxx]: %DAEMON-6: EVENT Flags irb.190 index 373 10.9.53.254/22 -> 10.9.53.255 <Up Broadcast> PR1162699
MQCHIP reports continuous "FI Cell underflow at the state stage" message and continuous fabric drops on ADPC ICHIP Packet Forwarding Engines after ISSU on MX with ADPC. PR1163776
The ability to configure a multicast group statically for a subscriber via a dynamic profile is not available in this release. Using the following statement, the subscriber can be enabled to receive multicast traffic for group 224.117.71.1 upon login: set dynamic-profiles <client profile> protocols igmp interface "$junos-interface-name" static group 224.117.71.1 This support is not available and the subscriber needs to send a IGMP protocol JOIN message to receive multicast traffic. PR1164323
On Junos OS Release 15.1 and later, on MS-MPC or MS-PIC, OSPF adjacency may fail to establish when there is no static route pointing to service PIC. PR1164517
With IKEv1, MS-MPC packet drops on far-end after reboot of local MS-MPC. PR1165787
When MS-MPC is used, if any bridging domain related configuration exists (e.g. “family bridge”, “vlan-bridge”, “family evpn”, etc), in some cases, continuous MS-MPC crash hence traffic loss may occur. PR1169508
If a given demux VLAN hosts both dynamic IP demux subscribers as well as static IP demux interfaces, it is possible that the dynamic IP demux subscribers appear to bind successfully, but they can experience forwarding problems. In this scenario, the dynamic subscriber state is not fully established on the line card, resulting in traffic issues. PR1170019
Adding keyword 'fast-filter-lookup' to existing filters of an input or output filter list may result in failure to pass traffic. To avoid this issue, the filter list should first be deactivated then the filters updated with a the keyword 'fast-filter-lookup; then the filter list activated. PR1170286
If the "no-cell-share" configuration statement under the chassis stanza is activated on MPC3, MPC4, MPC5, or MPC6 cards, the Packet Forwarding Engine will only be able to forward about 62Gbps versus ~130Gbps and causing fabric queue drops. PR1170805
The fan speed logic does not operate correctly once PEM on MX104 platforms does automatically shutdown due to over temperature protection. The fan speed moves back to speed normal. It takes more time for PEM to cool down and come back online automatically with fan at normal speed. PR1174528
Storm control feature is not working on MX104 platforms. In Packet Forwarding Engine, associated filters and vty commands are not visible as well. It works on other MX Series platforms. PR1176575
Major alarm is raised although the HDD is properly mounted. This is cosmetic and can be safely ignored. lab@Router-re0> show system alarms 1 alarms currently active Alarm time Class Description 2016-06-03 12:34:32 JST Major Host 1 failed to mount /var off HDD, emergency /var created *** messages *** Jun 3 12:34:32.542 Router-re0 chassisd[xxxx]: CHASSISD_SNMP_TRAP7: SNMP trap generated: Failed to mount /var off HDD, emergency /var created (jnxFruContentsIndex 9, jnxFruL1Index 2, jnxFruL2Index 0, jnxFruL3Index 0, jnxFruName Routing Engine 1, jnxFruType 6, jnxFruSlot 1) Jun 3 12:34:32.543 Router-re0 alarmd[xxxx]: Alarm set: Routing Engine color=RED, class=CHASSIS, reason=Host 1 failed to mount /var off HDD, emergency /var created Jun 3 12:34:32.543 Router-re0 craftd[xxxx]: Major alarm set, Host 1 failed to mount /var off HDD, emergency /var created Jun 3 12:34:36.311 Router-re0 alarmd[xxxx]: Alarm cleared: Routing Engine color=YELLOW, class=CHASSIS, reason=Loss of communication with Backup Routing Engine Jun 3 12:34:36.311 Router-re0 craftd[xxxx]: Minor alarm cleared, Loss of communication with Backup Routing Engine lab@Router-re0> show chassis hardware detail Hardware inventory: Item Version Part number Serial number Description Chassis JN108EC81AEA M120 Midplane REV 01 710-016198 RB8325 M120 Midplane FPM Board REV 06 710-011407 DA9342 M120 FPM Board FPM Display REV 02 710-011405 DA9116 M120 FPM Display FPM CIP REV 05 710-011410 DA9002 M120 FPM CIP PEM 0 Rev 04 740-011936 000190 AC Power Entry Module Routing Engine 0 REV 07 740-014082 9009004158 RE-A-2000 ad0 999 MB SILICONSYSTEMS INC 1GB 168CTS79SD805DC30289 Compact Flash ad2 38154 MB FUJITSU MHV2040BS NW26T7525Y4M Hard Disk Routing Engine 1 REV 07 740-014082 9009003624 RE-A-2000 ad0 999 MB SILICONSYSTEMS INC 1GB 128CTS79Sz705DC20485 Compact Flash ad2 38154 MB FUJITSU MHV2040BS NW26T7826D39 Hard Disk <<<<<<<<<<<<<<< HDD is properly mounted CB 0 REV 07 710-011403 DB4442 M120 Control Board CB 1 REV 07 710-011403 DB4473 M120 Control Board. PR1177571
MACSEC not working on layer 3 interface on MX104. PR1177630
In a rare error scenario krt_q_entry of flow route was freed without dequeuing it from queue. This has been fixed via software change. PR1178633
In MX Series running a Junos OS Subscriber Management Build, with more than 300+ firewall filters configured, it was found that an subscriber failed to login due to NACK received from system, stating the following error: BBE_DFW_DYN_PROF_ERR_STR session_id=1784: Can't find filter template named test300. BBE_DFW_DYN_PROF_ERR_CODE session_id=1784: Error code 13: Filter template not found. While the firewall filter named "test300" was certainly configured under the firewall filter configuration stanza; it found that the BBE daemon could hold a count of 256 filters only. Filters above this count were not getting indexed into the internal filter table and hence system could not find the filter. PR1178671
In EVPN A/S mode, IFL mark down programming at the Packet Forwarding Engine on the BDF gets removed causing traffic loops. PR1179026
[EVPN] Active-Active IP4 L3 session with CE over IRB Flaps. PR1179105
When an MPC has training failure on all planes, then other MPCs in the system are getting affected. The root cause is that MQ MPC are not deleting the streams of the MPC which is causing the fabric wedge and effecting other MPCs. As a result FH is kicking in for other MPCs in the system. PR1183230
When IPv4 firewall filter have 2625/32 destination in prefix-list, filter attached to subscriber interface is found broken. PR1184543
Nexthop attribute in a framed route is not applicable anymore. Since subscriber IP address is used as the nexthop in all cases, there is no need to have an additional attribute for nexthop for framed routes. PR1186046
High Availability (HA) and Resiliency
With NSR enabled on multiple Routing Engine system, when dynamic GRE tunnel is configured, performing Routing Engine switchover might cause rpd crash repeatedly on backup Routing Engine. PR1130203
After graceful switchover is triggered in master VRRP router for the first time, the master state for all the VRRP instances are toggled to backup and comes back to master immediately. During this time all the traffic are dropped and comes back. PR1142227
MXVC: ISSU failed after all FPC upgraded, TCP connection to kernel was dropped due to invalid IPC type 20. PR1163807
Infrastructure
In scaling setup (in this case, there are 1000 VLANs, 1000 Bridge Domains, 120 IRB interfaces, 120 VRRP instances, BGP and IGP), if the routing protocols are deactivated and activated, there might be a chance that the pending route stats are not cleaned up, which will cause the stats infra to have stale pointers and lead to memory corruption in socket layers. The system might go to db prompt because of this. All the traffic going through the router will be dropped. PR1146720
In Legacy, if the Routing Engine switchover on RPD crash configuration statement is enabled, the switchover is instigated in the Routing Engine kernel BEFORE the RPD core is created. In FreeBSD 10.x-based Junos OS, it is done AFTER. This creates an issue in scaled setups where the size of the RPD core, and therefore the time to create it, takes a lot longer. An FreeBSD 10.x-based Junos OS FreeBSD change has been made as part of this PR patch to adopt the same behavior as Legacy. PR1164139
Support for fast Routing Engine mastership switchover when master Routing Engine crashes was broken in Occam releases up to and including 15.1R3. The patch for this PR addresses support for this feature in Occam in 15.1R4. PR1167385
Issue is specific to Occam based images and is a very hard to induce. The issue occurs when virtual memory is low, and the FreeBSD OS tries to free-up memory by invoking the vm_lowmem event. In a multi-core environment, multiple kernel threads could encounter the low virtual memory condition at the same time, and so the vm_lowmem event could be invoked within the context of multiple kernel threads concurrently. Some of the protocols in the Junos networking stack register handlers against this event and two of these, clnp_drain() & tcp_drain(), were not SMP safe, which caused data corruption. clnp_drain() & tcp_drain() have now been made SMP safe; all other such handlers in the Junos networking stack were already SMP safe. PR1182958
Interfaces and Chassis
Due to movement of SNMP stats model from synchronous requests to asynchronous requests in Junos OS Release 13.3R1, the IQ2/IQ2E PIC, which has limited memory and CPU power, can not handle scaling SNMP polling at high rate (e.g., a burst of 4800 SNMP requests). This issue comes with high rate SNMP stats polling for IQ2/IQ2E interfaces or Aggregated Ethernet (AE) interface with IQ2/IQ2E as member links. These memory failures can cause IQ2/IQ2E PIC reboot because keep alive messages will also not get memory. PR1136702
When we polling SNMP MIBs for IPv6 traffic, for example, jnxIpv6IfInOctets, the logical interface (IFL) on IQ2 or IQ2E PIC may occasionally report double statistics. PR1138493
%DAEMON-3-CHASSISD_I2C_WRITE_ERROR: i2cs_write_reg: write error for group 8 at address 0x49, offset 32 %DAEMON-3-CHASSISD_I2CS_READBACK_ERROR: Readback error from I2C slave for FPC 1 ([0x11, 0x42] -> 0x0) - The above errors represent transient communication issues between between system components. - In certain cases, these can be service impacting. - Enhancements have been made for better handling of such error conditions. PR1139920
On OAM maintenance domain intermediate Point (MIP), the connectivity fault management (CFM) will not be enabled on L2VPN interface if it is configured after L2VPN is up. PR1145001
During a VRRP configuration change involving IP address change and/or VRRP configuration change while retaining same group ID, a race condition might occur causing vrrpd crash. PR1145170
In affected releases, the following cosmetic alarms are seen after reseating the clocking cables: 2015-11-13 05:22:56 UTC Major CB 0 External-A LOS 2015-11-13 05:22:56 UTC Major CB 0 External-B LOS. PR1152035
Remove MX Series from sending LCD halt message. PR1153219
SONET interface on MIC-3D-1OC192-XFP does not count input error correctly. While hardware counts framing error, runts and giants but input error in 'show interface extensive' command reports runts and giant only. PR1154268
When the master Routing Engine in the Virtual Chassis master router (VC-Mm) runs with high CPU (e.g. 99% CPU utilization), after a global/local switchover, the new master Routing Engine might relinquish its mastership during high CPU conditions. But the Virtual Chassis protocol role is not changed properly after the kernel relinquishes the mastership, causing dual master Routing Engines on this member router. PR1156337
"monitor interface <if name>" will start ifmon process. In this time if telnet session to router is disconnected unconventionally, then ifmon process was not killed and it will take up 100% CPU utilization. The workaround is to terminate the stale ifmon process. PR1162521
Internal timing for bringing FPCs online is extended for MX 2020/2010 systems to accommodate longer initialization times for fabric and FPCs. PR1164147
CLI commit warning is replaced by syslog warning message when limited-ifl-scaling configuration statement is configured. Warning message text remains the same. PR1165357
jpppd core at SessionDatabase::getAttribute() from Ppp::LinkInterfaceMsOper::getLowerInterfaceType() PR1165543
If an interface configured with VRRP is removed from a routing-instance to global, or from global to a routing-instance, the Logical Interfaces of that interface will be deleted and recreated. In ideal case as the interface gets deleted VRRP should move to bringup state, when the interface is created again VRRP goes to previous state. After this VRRP should get VIP addition notification from kernel and update VRRP state and group id for VIP. However, in race conditions, VRRP might get VIP addition notification from kernel even before the interface creation event happens. If so VRRP will never be able to update proper VRRP state and group id. So the VIP will reply for the ARP with an incorrect MAC ending with "00" while the correct MAC should end with the groups id configured. PR1169808
DCD core :/src/junos/sbin/dcd/infra/lag-link-dist/lag_link_dist_db.c:2147 PR1175254
jpppd: RLIMIT_STACK & RLIMIT_SBSIZE messages are marked incorrectly at NOTICE level instead of at INFO level. PR1178895
pppoed denies PADO for legitimate user PPPoE trace logs will report "Dropping PADI due to Duplicate Client" but there will be no subscriber logged in with that MAC address PR1179931
Commit check may exit without providing correct error message and causing dcd exit. The only known scenario to trigger this issue is to configure a IPv6 host address with any other address on the same family. PR1180426
Layer 2 Features
In BGP-based VPLS scenarios, changing the configuration of a VPLS mesh group might cause rpd core. FPC reboot might also be seen during the rpd core. PR1123155
From Junos OS Release 13.2R1 and later, the rpd process might crash when adding/deleting Virtual private LAN service (VPLS) neighbors in a single commit. For example, a primary neighbor is changed to become the backup neighbor. PR1151497
On GRES switch of mastership of Routing Engine via "request chassis routing-engine master switch", the dot1xd daemon will crash multiple times when 128K Logical Interfaces are configured in the MX960 chassis PR1118475
On MX Series platforms, in DHCP subscriber management environment (the device is either used as local DHCP server or DHCP relay agent), if configuring the Aggregate Ethernet (AE) interface (e.g. change the "MTU" of AE) while there are subscribers on it, in race condition, the DHCP binding failure would occur on the AE. PR1139394
In some cases where DHCP client devices are not fully protocol compliant they may become stuck trying to Renew an address lease indefinitely. These devices exposed a defect in the DHCP Relay behavior when acting as a proxy for the Server where a protocol NAK to restart the client was not properly created. As a result Address resources could be locked on the Relay preventing their use until the offending client device was restarted. PR1153837
In Junos OS Release 15.1R3 with tomcat mode enabled, DHCP subscriber management with IRB interfaces is not reliable. It is possible that the DHCP bindings are unable to fully establish with IRB interfaces due to this reason. However, these bindings with same IRB interfaces should come up properly with tomcat disabled. PR1155502
MPLS
In MPLS environment, the master Routing Engine might crash due to Mbuffer allocation failure and this crash will trigger an Routing Engine switchover, as a result Backup Routing Engine will become active. The issue is unreproducible, and trigger condition is not clear. PR979448
During interoperation with CISCO device (e.g. CRS) belongs to different IGP area, if the P2MP LSP ping echo reply message from Cisco device is using interface address other than loopback/router-id as the source address, the reply message will be dropped on Junos OS device. With the fix, Junos OS device will accept the packets and print them as 'uncorrelated responses'. PR1117166
Due to some data structure changes of ipc messages in 64-bit RPD, some of 32-bit applications (e.g. lsping, lspmon) would not work normally when RPD is running in 64-bit mode. Depends on Junos OS version, some of CLI commands might not work as expected. PR1125266
While changing the label action for a static-label-switched-path from "stitch" to "pop", the routes added by stitch functionality is restored and there is no criteria for deleting the routes. Because of this, rpd crash might be seen. PR1127348
MPLS TED might not select random links to calculate the ERO when OSPF is overloaded. Instead, only one or two interfaces will be used for all the configured LSPs originating from the router. PR1147832
With RSVP refresh reduction feature enabled (using RSVP aggregate messages), when changing the configuration statement "no-load-balance-label-capability" to "load-balance-label-capability" on the egress router, the Entropy Label Capability (ELC) for the egress router would not being propagated towards the ingress. As a workaround, we can execute "clear rsvp session" on the ingress or wait until 3 refresh cycles (say 100s with default RSVP refresh config). PR1150624
Static MPLS LSP using VT interface as a outgoing interface would not come up PR1151737
LSPing returns 'routing instance does not exist' when used in vpls routing-instance under logical system. PR1159588
If container LSP name and the suffix together are more than 60 characters in length, rpd process might crash during extensive split merge conditions. Its always advisable to keep them less than 60 characters. The member lsp name is coined in the following manner: <container name>-<suffix name>-<member count>- The LSP name can have upto 64 characters. So after putting together the container name, suffix, member-count (could go up to 2 digits), and the 2 hyphens, it should not exceed 64. So container-name and suffix together should not exceed 60 characters. A commit check will be added to throw warning if the name is more than supported character long. PR1160093
When L2VPN composite next hop configuration statement is enabled along with L2VPN control-word, end-to-end communication fails. Because in this scenario, control-word is not inserted by the ingress PE, but other end expects the control-word. PR1164584
Changing maximum-labels configuration under the hierarchy [edit interfaces interface-name unit logical-unit-number family mpls] might cause existing MPLS LSPs to become unusable. The root cause of this issue is that the family MPLS gets deleted and re-added. PR1166470
In LDP-signaled VPLS environment, other vendor sends an Address Withdraw Message with FEC TLV but without MAC list TLV. The LDP expected that Address Withdraw Message with FEC TLV should always have MAC list TLV. As such, it rejected the message and close the LDP session. The following message can be seen when this issue occurs: A@lab> show log messages |match TLV RPD_LDP_SESSIONDOWN: LDP session xxx.xxx.xxx.xxx is down, reason: received bad TLV PR1168849
In MVPN scenario, if active primary path goes down, then PLR(Point of Local Repair) needs to send Label Withdraw for old path and new Label Mapping for new path to the new upstream neighbor. In this case, LDP P2MP path may stay in "Inactive" state for indefinite time if an LSR receives a Label Release, immediately followed by a Label Mapping for the same P2MP LSP from the downstream neighbor. PR1170847
The rpd might crash upon receiving a TLE (Tag Label Element) delete notification arriving during a cleanup sequence. When adaptive teardown is configured and TLE delete notification comes during a cleanup sequence, this will trigger a recursive clean up and since the same cleanup routines are called and them being non-reentrant causes the code to assert. PR1172567
When the egress LSR withdraws the label for its egress route, the rlfa nexthop for the ldp route for the egress remains in other routers running rlfs. A routing loop is formed when the rlfa nexthops for some of the router are pointing towards each other. Any traffic for the label route would loop until TTL expires. After the fix,rlfa nexthop with nexthop label alone will not be considered as valid lsp nexthop (primary nexthop). ldp will send label withdraw for the label binding and delete the ldp route to avoid any potential routing loop. PR1172581
Network Management and Monitoring
Eventd might run out of memory and crash because of excessive kernel logging. PR1162722
Platform and Infrastructure
With "chassis maximum-ecmp 64" configured, when there is a route having 64 ECMP LSP next-hops and CoS-based forwarding (CBF) is enabled with 8 forwarding class (64*8=512 next-hops), not all next-hops will be installed on Packet Forwarding Engine due to crossing the boundary in the kernel when number of ECMP next-hops is large than 309. PR917732
When the 'enhanced-hash-key services-loadbalancing' feature is used by MX Series with MPCs/MICs based line cards, load balancing of flows across multiple service PICs via the source-address across does not work when internal BGP (IBGP) is used to steer traffic to the inside service-interface. For example the operator will see on the stateful firewall that the same source-address has flows across multiple service interfaces. PR1034770
On MX Series-based platforms, when learning the MAC address from the pseudo-IFL (for example, label-switched interface), if the MAC address is aged out in the source FPC where the MAC got learned, due to the delay (around 2 to 3 milliseconds) of MAC address deleting message processed in the source FPC and the egress FPC (destination FPC of the traffic), the MAC address might be deleted first from the egress Packet Forwarding Engine but get added again during these 2-3 milliseconds time intervals (As there is continuous traffic coming on the egress FPC destined to this MAC, the MAC query is generated and sent to the Routing Engine and source FPC. Since the source FPC has not yet processed the MAC-deleted message, it sends the response, so stale MAC will get added on the egress Packet Forwarding Engine). In this situation, no L2 flooding would occur for the "unknown" unicast (since the MAC address is present on the egress Packet Forwarding Engine). PR1081881
In certain cases, with some events such as disable/enable of links followed by Routing Engine rebooting or GRES enabled switch-over, below error message could be seen due to a software bug where it doesn't handle an internal flag properly. KERNEL/Packet Forwarding Engine APP=NH OUT OF SYNC: error code 1 REASON: invalid NH add received for an already existing nh ERROR-SPECIFIC INFO: PR1107170
Configuring one group with configuration of routing-instances and applying this group under routing-instances, then the rpd process will crash after executing "deactivating/activating routing-instances" commands. As a workaround, you can avoid using "apply-groups" under routing-instances hierarchy. PR1109924
On MX Series with MPCs/MICs based linecard platform, if FPC offline is performed while FPC is in online progress (online process is at the stage of fabric links training), in very corner scenario, the Routing Engines state is stale and being sent to other existing FPCs, so the traffic forwarding might be affected. PR1130440
Doing a file copy from a Routing-Engine running Junos OS image to a Routing-Engine running Junos OS with Upgraded FreeBSD image fails. PR1132682
When there are additional messages related to FIPS generated during <commit configuration> rpc reply, the xml-tags closing tag <routing engine> may be missed in the reply. PR1141911
FPC can crash and core due to a missing NULL check PR1144381
During an ISSU upgrade in MXVC environment, linecards may crash causing service impact. When the linecards come up, there may be a nexthop programming issue as a secondary impact and some Logical Interfaces may not pass traffic. Affected linecards need to be rebooted to recover from this condition. PR1152048
With Enhanced LAG mode enabled and sampling configured on AE interfaces, MS-DPC might drop all traffic as "regular discard". Disabling Enhanced LAG mode would avoid this issue. PR1154394
On MX2000 Series platforms, when MPC goes down ungracefully, other MPCs in the chassis will experience "destination timeout". In this situation, auto fabric-healing will get triggered due to "destination timeout" condition, which may cause Fabric-Plane reset, even all other MPCs to be restarted in some cases. PR1156069
cosd[20362]: cosd_config_database: Configuration database(/var/run/db/juniper-prop.data) does not exist. cosd[20460]: cosd_config_database: Configuration database(/var/run/db/juniper-prop.data) does not exist. The above log messages may be seen after after some commits. These messages do not pose an operational impact. PR1158127
If one logging user is a remote TACACS/RADIUS user, this remote user will be mapped to a local user on device. For permissions authorization of flow-tap operations, when they are set on the local device without setting the permissions on the remote server, they cannot work correctly. The flow-tap operations are as follow: flow-tap -- Can view flow-tap configuration flow-tap-control -- Can modify flow-tap configuration flow-tap-operation -- Can tap flows PR1159832
LU(or XL) and XM chip based linecard might go to wedge condition after receiving corrupted packets, and this might cause linecard rebooting. PR1160079
NPC cored vpanic in trinity_firewall_start_nh_get,trinity_firewall_add_and_check_internal,trini ty_firewall_add_and_check. This line card core could potentially occur after an ISSU upgrade. PR1160748
The following commit warning may be seen when using configure private and multi-line comments. This causes the commit to not complete. warning: outgoing comment does not match patch PR1161566
Due to software bug on chassisd, backup CB temperature information is missing on cli command 'show chassis environment cb' if it's replaced once. PR1163537
For MX Series Virtual Chassis with default-address-selection configured, when we have a discard route to a specific subnet ( for example, 10.0.0.0/8 ) with discard next-hop, and at the same time we have more specific routes through other interfaces ( for example, 10.1.1.1 through xe-0/0/0 ), if a UDP packet is being sent to 10.1.1.1 through xe-0/0/0 while interface xe-0/0/0 flaps or FPC reboots, it might cause kernel crash on both Master Routing Engine in the Virtual Chassis master router (VC-Mm) and Master Routing Engine in Virtual Chassis backup router (VC-Bm). As a workaround, we can disable default-address-selection configuration. PR1163706
Below log can be seen on MX2020 after One FPC was pulled out and committing the configration related interface. CHASSISD_UNSUPPORTED_FPC: FPC with I2C ID of 0x0 is not supported PR1164512
A sonet interface configured as unnumbered BFD session fails to come up. PR1165720
Modifying the configuration of a hierarchical policer when in use by more than 4000 subscribers on an FPC can cause the FPC to core and restart. PR1166123
There are three issues related to DDOS reported in the PR 1168425. 1) Some policers are configurable, but do no react when disabling them (tunnel-ka aggregate, re-services-v6 capti..v6, syslog aggregate) With the fix all the configurable DDOS protocol parameter changes will get reflected correctly in Packet Forwarding Engine. 2) Some policers for non-unclassified traffic are non-configurable (control aggregate, mcast-snoop mld, ipsec aggregate, uncls resolve-v4, uncls resolve-v6, uncls filter-v4, uncls filter-v6, tunnel-ka aggregate). These policers are internally deprecated or renamed and not shown on CLI anymore. So any configuration will not come to the Packet Forwarding Engine sides. 3) Some policers are for unclassified traffic are non-zero (mlp unclass, services unclass, radius unclass, ip-frag unclass, gre unclass, re-services unclass, re-services-v6 unclass) We do not have a convention of setting unclassified to 0. Consider this as FAD. PR1168425
In Junos OS Release 15.1, a customized password prompt that can be sent by a TACACS+ server is not displayed to the user upon login. A usual password prompt "Password: " is displayed instead. The issue is seen when the following conditions are met: 1. Junos OS Release 15.1 without the fix for this PR is used. 2. TACACS+ is used for the user authentication 3. When user logs in, TACACS+ server sends a customized password prompt for this user. For example, this can cause an issue when S/KEY-based one-time password (OTP) authentication is configured for a particular user on the TACACS+ server because the user might be unable to calculate the one-time password as they would not see the key sequence number and the seed provided by the authentication server. PR1168634
Because the sequence number in RPM ICMP-PING probes is introduced as 32-bit variable instead of 16-bit, if it increases and reaches the max value 65535, it does not rollover, which might cause all RPM ICMP-PING probes to fail and not succeed any more. PR1168874
In affected release, if user runs the Packet Forwarding Engine debug command like "show sample-rr eg-table ipv4 entry ifl-index 1224 gateway 113.197.15.66" will cause the MPC crash. PR1169370
Long container elements can have keys which could be very big in size. If the key is more than 256, max key length in Patricia tree, mustd is coring, which leads router into amnesiac mode and any login is denied. PR1169516
Layer 2 protocols might flap when router was flooded with low priority traffic reaching towards FPC CPU/Routing Engine CPU when DDoS protection is disabled. PR1172409
On MPC5E, MPC6E, MPC7E, MPC8E, MPC9E, and MPCNG linecards, firewall filter of family inet/inet6/vpls configured with non-contiguous prefixes for address matching might fail and cause traffic drop. Using only contiguous prefixes can avoid this issue. PR1172725
On all Junos OS platforms, when using RADIUS server, after RADIUS request is successfully sent by Junos device, if the network goes down suddenly, then response sent by the RADIUS server is not received within timeout period. In this scenario, the RADIUS request will be sent again with invalid socket descriptor, which will lead to auditd (provides an intermediary for sending audit records to RADIUS and/or TACACS+ servers) crash. PR1173018
"show arp" command can't get complete results and reports "error: could not find interface entry for given index". PR1174150
On MX2010/2020, MPC/SFB cards do not boot up if single phase AC PSMs are turned ON sequentially with interval even though the PSMs have sufficient remaining power. PR1176533
A flow is determined by doing hashing on the packet header. Usually 5-tuple (src/dest IP addresses, IP protocol number, src/dest ports) are used for hashing because a flow is defined by 5-tuple. This is all fine for TCP and UDP packets. But layer-3 packets generated by JDSU tester only have layer-3 header and do not have layer-4 header. JDSU tester uses the same location as layer-4 header as packets' sequence number. So MX Series with MPCs/MICs card treats sequence number of JDSU tester packets as layer-4 header of a packet, hence, Junos OS thinks every packet is a single flow and order of different flows are not guaranteed. PR1177418
When IPv6 route points to aggregated Ethernet bundle, J-Flow record shows outgoing interface as child interface and not actual aggregated Ethernet interface. PR1177790
Routing Policy and Firewall Filters
Interface-routes rib-group import-policy is not in effect to filter prefixes correctly. All direct prefixes could be installed into the secondary route table. PR1171451
Routing Protocols
When configuring router in RR mode (cluster-id or option B MP-eBGP peering), the advertise-external feature will not be applicable in local VRFs due to a different route selection/advertisement process (main bgp.l3vpn.0 vs VRF.inet.0). PR1023693
BFD session configured with authentication of algorithm keyed-sha1 and keyed-md5 might be flapping occasionally due to FPC internal clock skew. PR1113744
During many types of configuration changes, especially including import policy, BGP has the need to re-evaluate the routes it has learned from peers impacted by the configuration change. This re-evaluation involves re-running import policy to see if there is any changes to the learned routes after applying the new policy. This work is done in the background as part of an "Import Evaluation" job. When BGP is reconfigured a second time, and the "Import Evaluation job" has not completed, it is necessary to re-run the job from the beginning if there's another change to policy or something with similar impact. This state is noted as "Import Evaluation Pending". However, in this case, there was a bug that caused BGP to always enter the pending state upon reconfiguration, regardless of whether relevant changes were made to import or other similarly impactful configuration. The result is that once it is necessary to start re-evaluation of the routes for a peer, even trivial configuration changes that happen too quickly will cause the "Import Evaluation job" to need to run again as a result of the "Pending" flag being set. To avoid the issue, please ensuring that "ImportEval" is not present in a BGP peer's Flags output from the CLI show bgp neighbor prior to doing even trivial commits. PR1120190
In multicast environment, when the RP is first hop router (FHR) and it has MSDP peers, when the rpf interface on RP changed to MSDP facing interface, because the multicast traffic is still on the old rpf interface, a multicast discard route is installed and traffic loss is seen. PR1130238
In a situation which BGP is being used in combination with interface's rfp-check; deleted routes may see delay in propagation of BGP withdrawn messages. PR1135223
When interface IP MTU is less than 1464 bytes and the number of LSA headers in an OSPF DbD packet is big enough for it to exceed the MTU (i.e. OSPF database contains enough LSAs), unexpected fragmentation of OSPF DbD packets may occur due to incorrect calculation of maximum allowed payload size. PR1148526
In BGP scenario with large scale routing-instances and BGP peers configured, due to a software defect ( a long thread issue ), BGP slow convergence might be seen. For example, BGP might go down 8-9 seconds after BFD brings down the EBGP session. The rpd slip usually does not hurt anything functionally, but if the slip gets big enough, it could eventually cause tasks to not be done in time. For example, BGP keepalives with lower than 90 seconds hold-time might be impacted. There is no known workaround for this issue, but configuring configuration statement "protocol bgp precision-timers" can take care of the weak spot like sending BGP Keepalives. PR1157655
Starting from Junos OS Release 15.1R1 to Junos OS Release 15.1R3, and Junos OS Release 15.1F2 to Junos OS Release 15.1F4, Junos OS devices may not be able to establish BGP sessions with legacy router that does not support BGP optional parameters. The reason is that capability of supporting BGP open message fallback to no optional parameter is removed in these releases, which causes "OPEN Message Error (2)" during session setup. PR1163245
In BGP scenario with independent domain enabled in a VRF, when configuring a BGP session in a VRF routing instance with a wrong local-as number, some routes might be declared as hidden because of AS path loop. If later configuring the correct AS number as local-as and committing the configuration, those routes might still remain in hidden state. The hidden routes can be released after performing commands "commit full" or "clear bgp table <ANY_VRF>.net.0". PR1165301
In L3VPN scenario, feature multipath is configured under [set protocols bgp group] with L3VPN chained CNH under routing-options, the feature multipath does not work for L3VPN routes. PR1169289
When clearing IS-IS database, process rpd might crash due to a rare memory de-allocation failure that a task pointer is attempted to be freed twice. In the fix of this issue, the order of referencing the task pointer is being revised to avoid the occurrence of rpd crash. PR1169903
PIM bootstrap export policy is not working as expected when there are no pim neighbors up on the router PR1173607
Services Applications
When making a configuration change to a EXP type rewrite-rule applied to a SONET interface in an MX FPC Type 2 or MX FPC Type 3, if MS-DPC is also installed on the device, a MS-PIC core dump may be generated. PR1137941
In a rare situation in a SIP conversation we might end up in a situation where we have a child conversation whose entry in still present in in the parent conversation while the child flow is already deleted. While trying to delete this child flow from the parent conversation validate if the flow is valid and go ahead with deleting the child flow. PR1140496
When deleting NAT flow under a race condition the Service PIC can core PR1159028
These log messages no longer appear in syslog if log level is set to warning / error or higher. If the log level is set to notice or lower ( info / debug ) then these log messages are shown in syslog file. PR1162116
In Layer 2 Tunneling Protocol (L2TP) subscriber management environment, the jl2tpd process (L2TP daemon) might crash during clean-up of L2TP tunnel or session after it failed to establish. PR1162445
When traffic is flowing through MS-DPC card Service PIC and there is an active port block and some ports are assigned from that active port block, if changing the max-blocks-per-address setting to a lower value (lower than the current value), the service line card may crash. PR1169314
MS-PIC core-dump when MPLS or IPV6 routing updates are received. This is a race condition rarely seen while IPV6 or MPLS routes are deleted or added in the MS-PIC. PR1170869
Attempting to ping a subscriber address from the L2TP LNS CLI will fail. PR1187449
Subscriber Access Management
The range for the request-rate statement at the [edit access radius-options] hierarchy level has been extended to 100 through 4000 requests per second. In earlier releases, the range is 500 through 4000 requests per second. The default value is unchanged at 500 requests per second. PR1033668
If a DHCP local pool is exhausted, the newly dialed in subscriber B might get the IP address of newly logged out subscriber A, in a very rare condition, if the acc-stop message for A is sent to Radius server after acct-start for B, and if the Radius server identify the subscribers only by IP address but not by session, the subscriber B might get terminated. PR1079674
In DHCP relay scenario, DHCP relay binding might get stuck in "RELEASE(RELAY_STATE_WAIT_AUTH_REQ_RELEASE" state due to the LOGOUT Request is not processed correctly by authentication manager process (authd) if there were multiple attempts to activate Lawful Intercept (LI) for this DHCP subscriber using RADIUS change of authorization (CoA) packets in quick succession. PR1179199
User Interface and Configuration
From Junos OS Release 13.2R1 and later, the commitd process might crash while committing large configurations in a single commit, for example, committing 250k lines of config on top of existing config. This issue is due to a lack of storage space for current and running configurations. PR1159462
VPNs
Upon clearing p2mp lsp in dual-home topology, system is adding the same outgoing interface to the (S,G)OIL multiple times and thus duplicate/multiply the amount outgoing traffic. PR1147947
Resolved Issues: 15.1R3
Class of Service (CoS)
The chassis-scheduler-map is not applied to interface if FPC restart, Routing Engine switchover, or reboot. Only after deactivation/activation of the affected interface does the CoS get applied again. PR1132983
When the system has "system services subscriber-management enable" set (means the subscribers are VBF flow based), the ICMP MTU exceed notification may not be sent to subscribers, which will cause the subscriber Path MTU Discovery to fail. PR1138131
This PR does optimization in AE SNMP handling. If all the links in an AE bundle go down, then any CoS SNMP query for this AE IFD/IFL will return cached values. PR1140440
On the MX104 platform, when applying the "rate-limit" and the "buffer-size" on the logical tunnel (lt-) interface on the missing MIC (not inserted on MPC), commit failure with error message would occur. As a workaround, this issue could be avoided by applying the "rate-limit and "buffer-size" on inserted MIC, then commit. PR1142182
Forwarding and Sampling
The command "clear firewall all" will now clear the policer stats displayed by "show policer __auto_policer_template_1__", ... "show policer __auto_policer_template_8__". PR1072305
This issue is seen in Junos OS Release 14.2 and later releases. When Routing Engine based sampling is enabled and BGP session is using 4 byte AS, improper AS number can be found in sampling information. [router1]--------[DUT]--------[router2] AS 1,000 A AS 10,0000 | sampling 1.1.1.1 ---------------------->2.2.2.2 traffic --- traceoptions log --- Aug 10 12:21:21 v5 flow entry Aug 10 12:21:21 Src addr: 1.1.1.1 Aug 10 12:21:21 Dst addr: 2.2.2.2 Aug 10 12:21:21 Nhop addr: 20.20.20.1 Aug 10 12:21:21 Input interface: 747 Aug 10 12:21:21 Output interface: 749 Aug 10 12:21:21 Pkts in flow: 594 Aug 10 12:21:21 Bytes in flow: 49896 Aug 10 12:21:21 Start time of flow: 4648545 Aug 10 12:21:21 End time of flow: 4707547 Aug 10 12:21:21 Src port: 0 Aug 10 12:21:21 Dst port: 2048 Aug 10 12:21:21 TCP flags: 0x0 Aug 10 12:21:21 IP proto num: 1 Aug 10 12:21:21 TOS: 0x0 Aug 10 12:21:21 Src AS: 1000 Aug 10 12:21:21 Dst AS: 34464 <<<<< Aug 10 12:21:21 Src netmask len: 32 Aug 10 12:21:21 Dst netmask len: 32. PR1111731
On the MX Series platform with MX-FPC/DPC, M7/10i with Enhance-FEB, M120, M320 with E3-FPC, when there are large sized IPv6 firewall filters(for example, use prefix lists with 64k prefixes each) enabled, commit/commit check would fail and the dfwd process would crash after configuration commit/commit check. There is no operational impact. PR1120633
On all Junos OS platforms, when both the filter and the policer are configured for an interface, in rare cases, the policer template may not be received by the Packet Forwarding Engine (from the Routing Engine) when it is referenced by the filter term (normally the policer template gets received before the filter term referencing it which is ensured by mechanism in the Routing Engine kernel). In this situation, the FPC would crash due to this rare timing issue. This issue might be avoided by the recommended steps below: 1. Deactivate the physical interface (IFD) and commit 2. Enable any filter and policer that attached to the interface (e.g. IFL) and commit 3. Activate interface back. PR1128518
On MX80 and MX104 platform, applying firewall filter with MX Series specific match condition will raise the following warning message. Filter <filter_name> is MX Series specific; will not get installed on DPCs for interface <interface_name>. This warning message is needed for the other modular type MX Series platforms since it can have DPC and MPC mixed. But the message is not needed for MX80 and MX104 platform since they only have the MX Series based Packet Forwarding Engine. Although the warning message tells that the relevant firewall filter is not installed, the firewall filter is correctly installed into Packet Forwarding Engine. Thus, user can ignore the message in case the warning message is logged on MX80 and MX104 platform. PR1138220
For Junos OS release 14.1R1 and later, when a broadcast packet is sent in a scenario of Integrated routing and bridging (IRB) over Virtual Tunnel End Point (VTEP) over IRB, the packet is getting dropped in kernel as it was looping due to a software issue. The error log message "if_pfe_vtep_ttp_output: if_pfe_ttp_output failed with error 50" is observed when issue occurs. PR1145358
On MX Series-based platforms, in race condition, when using the policer which has configuration statement "bandwidth-percent" configured (e.g., set firewall policer XXX if-exceeding bandwidth-percent 80), if the logical interface (IFL) bandwidth change and the filter bind message arrive at the Packet Forwarding Engine out of order (e.g., when changing the bandwidth of the IFL or rebooting the FPC), the "bandwidth-percent" policer may end up using physical interface (IFD) bandwidth for "bandwidth-percent" computation. PR1154034
General Routing
On an MX Series Virtual Chassis platform, when we restart one or both of the standby Routing Engines, the log message "ksyncd_select_control_plane_proto: rhost_sysctlbyname_get: No such file or directory" might be observed as the ksyncd daemon attempts to select a communication protocol (UDP/TCP). After several tries, it will fall back to TCP and proceed as normal. PR945925
In an MX Series Virtual Chassis (MX-VC) environment, the private local nexthops and routes pointing to private local next hops are sent to the Packet Forwarding Engine from the master Routing Engine and not sent to the slave Routing Engine, then a Routing Engine switchover happens. Now as the new master Routing Engine does not know about such next hops and routes, they are not cleaned up. When a next hop with same index is added on the new master Routing Engine and sent to the Packet Forwarding Engine, the Packet Forwarding Engine might crash due to a stale next hop exist. PR951420
In a Layer 3 wholesale configuration, DHCPv6 advertise messages might be sent out with source MAC all zeroes if the subscriber is terminated on the demux interface in a non-default routing instance. For subscribers on default instance there is no such issue observed. PR972603
On MX Series routers with MPC3E, MPC4E, MPC5E, and MPC6E, Junos OS does not support short(sub-second) interface hold-time down configuration. So, a hidden configuration statement is introduced to ignore DFE tuning state during hold-down timer period. This configuration statement allows sub-second hold-down timer on MPC3E,MPC4E,MPC5E,MPC6E. set interfaces <intf name> hold-time up <U ms> down <D ms> alternative The configuration statement does not work/support 'MPC5E 3D Q 2CGE+4XGE' and 'MIC6 2X100GE CFP2 OTN', and we recommend configuring hold-time down to be more than 3 seconds for these two cards. PR1012365
On MX240/480/960/2010/2020 platform with Junos OS release 15.1R1 and later, the process health monitor process (pmond) is not available on the Routing Engine. The msppmond process on MS-MIC/MS-MPC tries to connect pmond process on Routing Engine continuously but fails. It will result in additional traffic between the MS-MIC/MS-MPC and Routing Engine, causing high CPU utilization. PR1014584
No performance or functional impact. Can be safely ignored. "Ignore the PTP message (2) as this MPC doesn't support EEC" should be moved from notice to debug level. PR1020161
MIC-3D-8OC3-2OC12-ATM Revision 22 or later is supported only by the following Junos OS releases: Junos OS Release 12.3 — 12.3R9 and later, Junos OS Release 13.3 — 13.3R6 and later, Junos OS Release 14.1 — 14.1R4 and later, Junos OS Release 14.2 — 14.2R3 and later, Junos OS Release 15.1 and later. PR1036071
There is a remote loop back feature in 802.3ah standard, where one end can put remote end into remote-loopback mode by sending enable loopback control lfm PDU. In remote loopback, all incoming packets (except lfm packets) are sent back on wire as it is. Transmit or receive of lfm packets should not be affected when an interface is in remote loopback mode. On the VMX platform when we configure the lfm remote-loopback we run into problem state, In problem state we will see that LFM packets sent from node which is in loopback state is not reaching the peer end hence we will not see the remote entity information for the "run show oam ethernet link-fault-management" command on peer router. PR1046423
On all routing platforms M Series, MX Series, T Series with BGP configured to carry flow-specification route, in case of deleting a filter term and policer, then add the same term and policer back (it usually happens in race condition when adding/deleting/adding the flow routes), since confirmation from dfwd for the deleting policer might not be received before attempting to add the same policer, the rpd would skip sending an add operation for it to dfwd. As a result, when the filter term is sent to dfwd and tell it to attach to the policer, dfwd had already deleted the policer, and since rpd skipped re-adding it, dfwd will reject the attach filter with policer not found error and rpd will crash correspondingly. PR1052887
When a labeled BGP route resolves over a route with MPLS label (e.g. LDP/RSVP routes), after clearing the LDP/RSVP routes, in the short window before the LDP/RSVP routes restore, if the BGP routes resolves over a direct route (e.g. a one-hop LSP), the rpd process might crash. PR1063796
When "satop-options" is configured on an E1 with Structure-Agnostic TDM over Packet (SAToP) encapsulation, after Automatic Protection Switching (APS) switchover, some SAToP E1s on the previously protect interface (now working) start showing drops. PR1066100
Upon BFD flapping on aggregate interfaces, the Lookup chip (XL) might send illegal packets to the center chip (XMCHIP) and compromise packet forwarding and an FPC restart is needed to recover from this condition. If Fabric path side is affected, the fabric healing process will initiate this process automatically to recover from such conditions. MPC6E/MPC5E/NG-MPC are exposed to this problem. Corrupted parcels from Lookup chip LU/XL to Center Chip (XM) can also compromise packet forwarding and report DRD parcel timeout errors. An additional parcel verification check is added to prevent sending corrupted parcels to the center chip (XM). PR1067234
ICMP echo_reply traffic with applications like IPsec will not work with the MS-MIC and MS-MPC cards in a asymmetric traffic environment since these cards employ a stateful firewall by default. The packet will be dropped at the Stateful Firewall since it sees an ICMP Reply that has not matching session. PR1072180
When flag is specified under ipsec-vpn traceoptions to trace IPsec operations, no message is logged to the specified trace file as expected. The issue impacts on debug capability only. PR1073705
On MX Series platform with MS-MPC/MS-MIC, when Network Address Translation (NAT), Stateful Firewall (SFW), Traffic Detection Function (TDF), or IPsec service is configured and traffic flows, an ordered packet might miss the descriptor due to the software defect. It results in prolonged flow-control, all data and control path are blocked, the service PIC goes down and not come up. PR1079745
Scheduler: Protect: Parity error for tick table single messages might appear on MPC3E/MPC4E/MPC5E/MPC6E/T4000-FPC5. PR1083959
In a two members MX Series Virtual Chassis (MXVC) environment, when "set virtual-chassis no-split-detection" is configured, if split master condition happens, which is caused by split events (i.e. loss of all adjacencies by link failure, FPC restarts, chassis power-down, Routing Engine reboots, etc), then once the VCP adjacency is formed again, the current design could not determine best chassis to win the protocol mastership election properly, instead, only the final election step (that is,choose the member device with the lowest MAC address) is used to elect the master device (protocol master of the VC, or VC-M). PR1090388
Scuba MPC6E Temperature Intake shows as "Testing" in "show chassis environment", but "show chassis environment fpc" and "show chassis fpc detail" are OK and provide the correct Temp information. > show chassis hardware | match fpc FPC 0 REV 66 750-044130 ABDA3551 MPC6E 3D FPC 9 REV 31 750-031087 CADR7177 MPC Type 1 3D FPC 10 REV 66 750-044130 ABCZ2741 MPC6E 3D {master} > show chassis environment | match "intake |state" | match fpc FPC 0 Intake Testing <<<<<<<< Wrong info FPC 9 Intake OK 37 degrees C / 98 degrees F FPC 10 Intake Testing <<<<<<<< Wrong info {master} > show chassis environment fpc | match "fpc|intake|state" FPC 0 status: State Online <<<<<<<<<<<<<<<<<< Correct info Temperature Intake 36 degrees C / 96 degrees F FPC 9 status: State Online Temperature Intake 37 degrees C / 98 degrees F FPC 10 status: State Online <<<<<<<<<<<<<<<<<< Correct info Temperature Intake 42 degrees C / 107 degrees F {master} > show chassis fpc detail Slot 0 information: State Online Temperature 36 <<<<<<<<<<<<<< Correct info Total CPU DRAM 3584 MB Total XR2 518 MB Total DDR DRAM 49920 MB Start time: 2015-05-12 12:36:14 AST Uptime: 9 days, 1 hour, 31 minutes, 38 seconds Max Power Consumption 1088 Watts Slot 9 information: State Online Temperature 37 Total CPU DRAM 2048 MB Total RLDRAM 331 MB Total DDR DRAM 1280 MB Start time: 2015-05-12 12:38:00 AST Uptime: 9 days, 1 hour, 29 minutes, 52 seconds Max Power Consumption 239 Watts Slot 10 information: State Online Temperature 42 <<<<<<<<<<<<<< Correct info Total CPU DRAM 3584 MB Total XR2 518 MB Total DDR DRAM 49920 MB Start time: 2015-05-12 12:36:18 AST Uptime: 9 days, 1 hour, 31 minutes, 34 seconds Max Power Consumption 1088 Watts PR1090671
Wrong diagnostic optics info might be seen for GE-LX10 SFP and SFP+ for SumitomoElectric. The issue only for a specific SFP type - "Xcvr vendor part number : SCP6F44-J3-ANE”, it can be seen with "show chassis pic fpc-slot X pic-slot Y". user@device> show chassis pic fpc-slot 0 pic-slot 0 .. PIC port information: Fiber Xcvr vendor Wave- Xcvr Port Cable type type Xcvr vendor part number length Firmware 0 GIGE 1000LX10 SM OPNEXT INC TRF5736AALB227 1310 nm 0.0 1 GIGE 1000LX10 SM FINISAR CORP. FTLF1318P2BTL-J1 1310 nm 0.0 2 GIGE 1000LX10 SM SumitomoElectric SCP6F44-J3-ANE 1310 nm 0.0 <<<<Error SFP>PR1091063
Occasionally, AFEB PCI reads from Cortona MIC with ATM OAM traffic might return garbage values even though the actual content in the MIC has the correct value, this corrupted values would lead to AFEB crash , and also PCI error logs such as : afeb0 PCI ERROR: 0:0:0:0 Timestamp 91614 msec. afeb0 PCI ERROR: 0:0:0:0 (0x0006) Status : 0x00004010 afeb0 PCI ERROR: 0:0:0:0 (0x001e) Secondary bus status : 0x00004000 afeb0 PCI ERROR: 0:0:0:0 (0x005e) Link status : 0x00000011 afeb0 PCI ERROR: 0:0:0:0 (0x0130) Root error status : 0x00000054 afeb0 PCI ERROR: 0:0:0:0 (0x0134) Error source ID : 0x02580258 afeb0 PCI ERROR: 0:2:11:0 Timestamp 91614 msec. afeb0 PCI ERROR: 0:2:11:0 (0x0006) Status : 0x00004010 afeb0 PCI ERROR: 0:2:11:0 (0x004a) Device status : 0x00000004 afeb0 PCI ERROR: 0:2:11:0 (0x0052) Link status : 0x00004001 afeb0 PCI ERROR: 0:2:11:0 (0x0104) Uncorrectable error status : 0x00000020 afeb0 PCI ERROR: 0:2:11:0 (0x0118) Advanced error cap & ctl : 0x000001e5 afeb0 PCI ERROR: 0:2:11:0 (0x011c) Header log 0 : 0x00000000 afeb0 PCI ERROR: 0:2:11:0 (0x0120) Header log 1 : 0x00000000 afeb0 PCI ERROR: 0:2:11:0 (0x0124) Header log 2 : 0x00000000 afeb0 PCI ERROR: 0:2:11:0 (0x0128) Header log 3 : 0x00000000 PR1097424
When the clock sync process (clksyncd) is stopped and resumed during link flaps, the clksyncd process might get into an inconsistent state with various symptoms, the clock source might be ineligible due to "Interface unit missing" or "Unsupported interface" with no Ethernet Synchronization Message Channel (ESMC) transmit interfaces. PR1098902
In abnormal session close scenario like by pulling-out running ms-mpc or in scaled flow environments, some garbage object can remain due to a bug on internal flow state machine then would trigger mspmand coredump. The fix of this PR clears such a problematic status objects. PR1100363
After Junos OS Release 13.3R1, IPCMON infra is added to debug IPCs between PFEMAN and the Routing Engine. When convergence occurs, string processing of IPCMOM will take added time. Then the slow convergence will be seen. It is a performance issue, it is visible in scaled scenario (for example, more than 100K routes). As a workaround, please execute command "set pfe ipclog filter clear" to disable IPC logging on all FPCs. PR1100851
Fragmenting a special host outbound IP packet with invalid IP header length (IP header length is greater than actual memory buffer packet header length), can trigger NULL mbuf accessing and dereferencing, which may lead to a kernel panic. PR1102044
On MX Series platforms, in subscriber management environment, when carrying scaling subscribers, as the Packet Forwarding Engine process (pfed) memory usage will grow along with the number of subscribers, the pfed memory usage limit may get reached (that is, 512M) because of the subscriber scale and number of service attached to the subscribers (for example, when carrying more than 140k single stack PPPoE subscribers per chassis, 4 services per subscriber), in this situation, the pfed crash may occur due to memory exhaustion. PR1102522
On MX Series platform, in subscriber management environment, if the subscriber's underlying logical interface (IFL) is static (for example, ge-x/y/z.0 or aex.0 rather than ge-x/y/z.32767 or aex.32767) with family inet configured, when all the subscribers are logged out, the ARP on the underlying IFL may stop resolving the next-hop path due to the incorrect deletion of ARP family of the underlying IFL when removing subscribers. PR1102681
With Nonstop active routing (NSR) enabled, deleting routing-instance/logical system configuration might cause a soft assert of rpd. If NSR is not enabled, after deleting routing-instance/logical system configuration, executing "restart routing" might trigger this issue too. The core files could be seen by executing CLI command "show system core-dumps". This timing issue has no function impact. PR1102767
cpcdd core observed in scaled scenario. PR1103675
On MX Series platform, when using DHCPv6 Prefix Delegation (DHCPv6-PD ) and DHCPv6 subscriber sessions are layered over an underlying PPPoE subscriber session, due to the fact that the value of the UDP checksum in Echo reply message might get incorrectly set to all zero (i.e. "0x0000"), a small number (for example, on a 1 to 5 subscribers out of 10000 subscribers basis) of subscribers might fail to renew the IPv6 addresses in each lease time circle.PR1103349
When using "write coredump" to invoke a live coredump on an FPC in T Series, the contents of R/SR ASIC memory (Jtree SRAM) will get dumped. In the situation that there is a parity error present in the SRAM, then the coredump will abort and the FPC will crash. As a workaround, configuring "set chassis pfe-debug flag disable-asic-sram-dump" before "write coredump" will help to avoid the issue. PR1105721
When mspmand (which manages the Multiservice PIC) core dump (when the mspmand crash, it will dump a core file for analysis) is in progress in MS-MPC/MS-MIC and a GRES command is issued at the same time, it is seen that the MS PIC gets stuck and has to be recovered by offlining/onlining the PIC. PR1105773
Dynamic vlan ifl is not removed with 'remove when-no-subscriber' configuration. PR1106776
When Bridge domain in PBB-EVPN Routing instance is modified to add/remove ISIDs BD can get stuck in destroyed state. This happens when ISIDs in the Bridge domain are changed from 1 to many or many to 1. This is only noticed during configuration changes or initial deployment. PR1107625
Under IPv6 VRRP scenario, when a host sends router solicitation messages to VRRP virtual IPv6 address, the VRRP master replies router advertisement messages with physical MAC address instead of virtual MAC, the VRRP slave replies router advertisement messages with physical MAC address as well. As a result, the host has two default gateways installed and the host will send traffic directly to two devices but not to the VRRP virtual IP. This issue affects VRRP function and traffic. PR1108366
On MX Series platform with "subscriber-management" enabled, while high scaled subscribers (for example, 126K dual-stack DHCP v4/v6 subscribers over VLAN demux) login/logout at high rate, MX Series-based line cards which hold subscribers might crash after the bbe-smgd process restart. PR1109280
On MX240/480/960 Series router with MS-DPC, customer running BGP over IPsec. This BGP session has a BFD session tied to it. The BGP session is up but the BFD session remains in INIT state. The issue might be seen with any service configured with multihop BFD enabled. Traffic forwarding will not be affected. PR1109660
In subscriber management environment and the accessing interface is an AE interface, after AE interface flap or FPC reboot, the subscriber traffic accounting might not be reported on demux interface but on the underlying AE interface. PR1110493
In rare condition, after Routing Engine switchover, the MPC PIC might offline, and some error messages might be seen. PR1110590
This issue is a regression defect introduced in Junos OS Release 11.4R11, 12.1R10, 12.2R8, 12.3R6, 13.2R4, 13.3R2, 14.1R1. After upgrading to those releases containing the original fix, when there is no export policy configured for forwarding table to select a specific LSP, whenever routes are resolved over RSVP (for example, due to aggressive auto-bandwidth), resolver will spend considerable amount of time on resolver tree, which contributes to base line increase in rpd/Routing Engine CPU. PR1110854
Resolved problem with Syslog messages generated like "krt_decode_resolve for 239.255.255.250, 101.11.67.33: no logical interface for index 1073741825" when Multicast packets are received on Subscriber interfaces. PR1110967
On MX Series platform, when using FTP Application-level gateway (ALG), if the FTP (including both active mode and passive mode) server requests client to use different IP address for control session and data session (i.e. after the control session is established, the destination IP address of FTP server is changed on which client should transfer the data), although the control session could be built, the data session could not be established due to wrong pinhole creation. The issue would not occur in the scenario that the port is changed while the destination IP address is the same. PR1111542
CLI core dump is due to repeated mismatched XML open/close directives in the "show pppoe lockout" output. This issue is most likely to occur when there is a ratio of 8 PPPoE clients in lockout per VLAN. PR1112326
Right now this fix is available from Junos OS Release 14.2R6 and later. On Junos OS Release 14.2R5 or older images MSRPC gates once opened would never gets deleted. From Junos OS Release 14.2R6 and later, MSRPC gates are opened for 60 mins no matter whether expected packet hits gate or not. After 60 minutes gates are deleted by timer. PR1112520
In the scenario that the power get removed from the MS-MPC, but Routing Engine is still online (for example, on MX960 platform with high capacity power supplies which split into two separate power zones, when the power zone for the MS-MPC line card loses power by switch off the PEM that supports the MS-MPC situated slot), if the power goes back (for example, switch on the PEM), the MS-MPC might be seen as "Unresponsive" (checked via CLI command "show chassis fpc") and not coming up back online due to failure of reading memory. PR1112716
Under certain conditions, when the Junos OS Routing Engine tries to send an IP packet over a IPIP tunnel, the lookup might end up in an infinite loop between two IPIP tunnels. This is caused by a routing loop causing the tunnel destination for Tunnel#A to be learned through Tunnel#B and the other way round. PR1112724
On all Junos OS platform, when the Junos Routing Engine tries to send an IP traffic over a GRE tunnel, the route lookup might end up in an infinite loop between two GRE tunnels (the infinite loop is caused by a routing loop causing the tunnel destination for Tunnel A to be learned through Tunnel B and the other way round), the kernel would crash as a result. As a workaround, the issue could be avoided by preventing the tunnel destination of a tunnel to be learned through a second tunnel (and the other way round). PR1113754
On MX Series Virtual Chassis with "subscriber-management" enabled, the BBE subscriber management daemon (bbe-smgd) might crash on the Virtual Chassis backup router (VC-Bm) during subscribers concurrent login/logout. The bbe-smgd process restart results in a minimal impact of system and subscribers. All connected subscribers remain connected. PR1113792
On MX Series routers with Junos OS release 12.3X54-D20 or 12.3X54-D25, Inverse multiplexing for ATM (IMA) interfaces on MIC-3D-4COC3-1COC12-CE may not come up due to "Insufficient Links FE" alarm. This is due to data corruption on the physical layer. PR1114095
On MX-VC with heartbeat connection, if it is in a scaled subscribers environment, when power down both VCM Routing Engines, there might be a delay (minutes) for backup chassis to be master and during which time, traffic blackhole might be seen. PR1115026
On MX Series Virtual Chassis with "subscriber-management" enabled, the BBE subscriber management daemon (bbe-smgd) might crash on the Virtual Chassis backup router (VC-Bm) during subscribers concurrent subscribers churn. The bbe-smgd process restart results in a minimal impact of system and subscribers. All connected subscribers remain connected. PR1115187
After VC Protocol Master Switch, new VCMm could allocate STP index of 1 (which is global discarding state) to new Physical Interfacess resulting in STP status incorrectly marked to discarding on the FPCs of the current VCBm. Please note for the fix to be effective, it is required that MXVC setup is rebooted once after upgrade of all the Routing Engines of the MXVC chassis with new fixed image following normal upgrade procedure and hence ISSU based upgrades are not supported. PR1115677
On a busy MX Series Virtual Chassis platform, for example, with 100k subscribers and 16k subscribers concurrent login/logout, the ksyncd process might crash on Virtual Chassis backup Routing Engines after a local or global graceful Routing Engine switchover (GRES). This issue has no service impact. PR1115922
For MPC6E with CFP2, there was a race condition between the Interrupt service routine and the periodic, as a result interface up/down will not happen for laser off/on. PR1115989
On MX240/MX480/MX960 platform with MS-DPC card, in some race conditions, after deactivating member interface of the aggregated multiservices (AMS) interface, the service PIC daemon (spd) might crash due to memory corruption. As a workaround, we should offline the member PICs before changing the AMS configuration and then online the PICs. PR1117218
On M Series /MX Series platform, the 10G Tunable SFP/SFP+ can not be tuned in Junos OS Release 15.1R2. PR1117242
In broadband edge (BBE) environments with graceful Routing Engine switchover (GRES) enabled, the BBE subscriber management daemon (bbe-smgd) might crash on the master Routing Engine after Routing Engine switchover. PR1117414
On MX Series routers containing multiple Packet Forwarding Engines such as MX240/MX480/MX960/MX2010/MX2020, with either MPC3E/MPC4E/MPC5E/MPC6E cards, if the routers have GRE decap, then certain packet sizes coming via these aforementioned line cards, at very high rate can cause these line cards to exhibit a lockup, and one or more of their Packet Forwarding Engines corrupt traffic towards the router fabric. PR1117665
During the LSP switch-over, the hiwatermark may get set to unexpectedly high value. The issue happens due to incorrect reference point taken while calculating the Max avg BW in the last interval and this results in incorrect Highest Watermark BW in the autobadnwidth stats. PR1118573
alg-logs and pcp-logs are not supported under [edit edit services service-set <ss name> syslog host local class] on ms interface as of now. Added warning message for the same during configuration commit. PR1118900
On MX Series platform, in rare condition, if removing or deactivating "member-interfaces" configured for an aggregated Multiservices (AMS) bundle (only officially supported on MS-MPC/MS-MIC), for example, using CLI command "deactivate interfaces ams0 load-balancing-options member-interface mams-7/1/0", all the MX Series-based FPCs and the MS-MPC/MS-MIC may crash. As a workaround, to avoid the issue, below is the recommended procedures to change AMS bundle size, 1. Offline member PICs 2. Change AMS configuration 3. Online member PICs PR1119092
The rpd process might crash when executing CLI command "show evpn database" with the combination of "vlan-id" and "mac-address". PR1119301
In the multicast environment with pd interface (interface on the rendezvous point (RP) that de-encapsulates packets), if execute GRES multiple times, and the GRES interval is less than 30 minutes, the routes on master Kernel are added and deleted for a short while. In rare condition, backup Kernel will not be able to see them. So after Routing Engine switchover, the new master Kernel will delete next-hop ID for such routes, but Packet Forwarding Engines will not see this deleted message. As a result, the Kernel/Packet Forwarding Engine are out of sync for such particular next-hop ID, it might trigger a reset of all the Packet Forwarding Engines. As a workaround, please do the Routing Engine switchover more than 30-minute intervals. PR1119836
On MS-MPC equipped MX Series platform, during the "three-way handshake" process, when receiving ACKs (e.g. after sending SYN and receiving SYN/ACK) with window size 0 (as reported, it is set to 0 by TCP client when using some proprietary protocol), the ACKs would be incorrectly dropped by the line card due to failure in TCP check. This issue could be avoided by preventing software from dropping packets that fail in the check, for example, by CLI command below, re# set interfaces ms-3/0/0 services-options ignore-errors tcp. PR1120079
The commands "show igmp interface <interface name>" and "show mld interface <interface name>" may sometimes result in memory corruption and cause a core dump of smg-service daemon. PR1120484
The commit latency will increase along with the increasing lines under [edit system services static-subscribers group <group name> interface]. Use ranges to create static demux interfaces is a recommended option. e.g.: [edit system services static-subscribers group PROFILE-STATIC_INTERFACE] + interface demux0.10001001 upto demux0.10003000; PR1121876
On MX240/MX480/MX960/MX2010/MX2020 products with MPC2E-3D-NG/MPC2E-3D-NG-Q/MPC3E-3D-NG/MPC3E-3D-NG-Q with MIC-3D-4XGE-XFP, Physical Interface flap detection is much slower. It might lead to high FRR time, some traffic might be lost. PR1122589
ovs-vxlan -- irb mac address is missing in ovs database. PR1122826
For scaled configuration, it may take too much time for commit, and session gets hung because there is an unnecessary check to see if family Ethernet-switching co-exists with family bridge for all interfaces having bridge configuration. PR1122863
MX Series router acting as L2TP access concentrator (LAC) may not recognize the MLPPP protocol field (0x003d) in the inbound PPP packet from customer premise equipment (CPE) and could disconnect the session not respecting idle-timeout. The traffic forwarding might be affected. PR1123233
When MX-VC is under a high latency transport condition (usually happens in DDoS attack), the performance might reduce and the backup Routing Engine's unnecessary and harmful resync operations could ultimately consume the entire available /mfs buffer space, which finally resulting in traffic loss. PR1123842
On MX Series platform, the MS-MPC crash may occur. The exact trigger of the issue is unknown, normally, this issue may happen over long hours (e.g. within a week) of traffic run (e.g. running HTTP/HTTPS/DNS/RTSP/TFP/FTP traffic profile). PR1124466
With BGP configured on CE-faced interfaces (in VRFs), doing 'show route' frequently may cause rpd to slowly leak memory. The leak rate will be one memory block of the size necessary to hold the instance name of the routing instance for a BGP neighbor. If the rpd process memory gets exhausted, the rpd process might crash, and the routing protocols are impacted and traffic disruption will be seen due to loss of routing information. You can check rpd memory usage with "show task memory brief" command. PR1124923
Right now this fix is available from Junos OS Release 14.2R6 and later. On Junos OS Release 14.2R5 or older images SUN RPC gates once opened would never get deleted. From Junos OS Release 14.2R6 and later, SUN RPC gates are opened for 60 minutes no matter whether expected packet hits gate or not. After 60 minutes gates are deleted by timer. PR1125690
In multihoming EVPN scenario and the customer facing interface is an AE interface, after moving an interface from the EVPN instance into a VPLS instance, traffic loss might be seen on CE facing FPC. PR1126155
In EVPN scenario, the EVPN route table between the master Routing Engine and backup Routing Engine would be different (unused garbage routes will appear) once Routing Engine switchover (e.g., by rebooting the "old" master Routing Engine or performing graceful routing engines switchover) is performed, which may cause kernel crash on the new master Routing Engine in some cases. PR1126195
When Junos OS devices use Link Layer Discovery (LLDP) Protocol, the command 'show lldp neighbors' displays the contents of PortID Type, Length, and Value (TLV) received from the peer in the field 'Port Info', and it could be the neighbor's port identifier or port description. Junos OS CLI configuration statement can select which 'interface-name' or 'SNMP ifIndex' to generate for the PortID TLV, so we do not have any problem as long as two Junos OS devices are connected for LLDP, but we might have an interoperability issue if other vender device which can map the configured 'port description' in the PortID TLV is used. In such case, Junos OS displays the neighbor's PortDescription TLV in the 'Port info' field, and if the peer sets 'port description' whose TLV length is longer than 33 byte(included), Junos is not able to accept the LLDP packets then discards packets as errors. The PortID TLV is given as : "the port id tlv length = port description field length + port id subtype(1B)". PR1126680
EVPN route attributes like the label and Ethernet segment identifier (ESI) may be missing from EVPN family routes installed by BGP. PR1126770
In multi-homing Ethernet VPN (EVPN), if there are two loopback addresses and the router-id and the primary loopback addresses are different on the designated forwarder (DF) PE, when the link between CE and DF PE down, the Type 4 route of old DF are not deleted properly from the backup PE and causing the new DF election failure. The traffic forwarding will be affected. As a workaround, we should configure single primary loopback address and remove "router-id" configuration statement on both multi-homing PEs. PR1126875
On M320/T320/T640 with FPC 1/2/3 and their enhanced version (-E2/-E), in multicast scenario and AE interface is within multicast NH (such as, AE interface is the downstream interface for a multicast flow), egress multicast statistics displays incorrectly after flapping of AE member links. PR1126956
An incorrect destination MAC address is applied to the packet when a DHCPv6 Offer/Advertise packet is sent back to the subscriber from a non-default routing instance across a pseudowire. PR1127364
On MX Series platform with "subscriber-management" enabled, when a dynamic DHCPv4 subscriber is stacked over a static VLAN and the "route-suppression access-internal" configuration statement is enabled, before the subscriber is established, it is possible for ARP process to first add a resolved route matching the subscriber's IP address. Then when the subscriber is established, the subscriber management process will change this route, but the change is not handled properly in the Packet Forwarding Engine. Due to this timing issue, the broadband network gateway (BNG) fails to forward transit packets to this subscriber. For example, the external DNS serverÂ’s response packets might not be delivered to the voice subscriber interface resulting in voice service outage. As a workaround, we can disable "route-suppression". PR1128375
On MX Series platform, when offlining the line card (possibly, with any of the line cards listed below), "Major alarm" might be seen due to HSL (link between line card and Packet Forwarding Engine) faults. This fault is non-fatal and would not cause service impact. The line cards that may hit the issue could be seen as below, MS-MPC/MS-MIC MIC-3D-8DS3-E3 MIC-3D-8CHDS3-E3-B MIC-3D-4OC3OC12-1OC48 MIC-3D-8OC3OC12-4OC48 MIC-3D-4CHOC3-2CHOC12 MIC-3D-8CHOC3-4CHOC12 MIC-3D-1OC192-XFP MIC-3D-1CHOC48. PR1128592
In current Juniper implementation, the IPv6 multicast Router Advertisement timer is not uniformly distributed value between MinRtrAdvInterval and MaxRtrAdvInterval as described in RFC 4861. PR1130329
When software encounters an error configuring the optics type into the VSC8248 PHY retimer component of an MX MIC/PIC (typically done on SFP+ module plugin), this could lead to 100% FPC CPU utilization indefinitely. MPCs and MICs that are potentially affected are: MPC3 + 10x10GE SFPP MIC MPC4 32XGE MPC4 2CGE+8XGE (10G interfaces only) MPC6 + 24x10GE (non-OTN) SFPP MIC. PR1130659
On MX with MS-MIC (or possibly, MS-MPC is affected as well), changing configuration of sampling input parameters, such as "rate" under forwarding-options is not reflected without restarting the line card. PR1131227
On MX Series based line cards, multiple modifications of firewall filter might cause lookup chip error and traffic blackhole, following jnh_free error messages could help to identify this issue: messages: fpc1 jnh_free(10212): ERROR [FW/3]:1 Paddr 0x006566a9, addr 0x2566a9, part_type 0call_stack 0x40497574 0x418ffa84 0x41900028 0x418ecf94 0x41861690. PR1131828
CLI output of "clear services sessions" gives an impression to the user that session is marked for deletion in case of delayed delete but the XML output "clear services sessions|display xml"of the above command says "session removed". Ideally both should convey the same message to the user. The changes have been made to make sure CLI and XML information given to the user in sync. PR1132006
Packet logs were not available in previous releases. Now in X55-D35 onwards and in mainline from (exact 14.2, 15.1 releases numbers to be determined), these logs will be available.. PR1132162
When customers do changes under "protocol router-advertisement interface X" (such as changing timers etc), they expect that commit would trigger an new router-advertisement being sent out to notify hosts about configuration changes. However it does not seem to be a case unfortunately. It makes the router information to expire on hosts and causes obvious loss of connectivity for the hosts. PR1132345
In subscriber management environment with autosense VLAN, if IP demux interface is not configured, the IGMP/MLD join message from client might be dropped due to "Bad Receive If". PR1132929
The subscribers login rate could be degraded when IGMP/MLD is enabled on the dynamic demux interface. PR1134558
On MX Series platforms with non-Q MPC (for example, MPC2-3D) or Q-MPC with enhanced-queueing off, when traffic has to egress on any one of the dynamic PPPoE (pp0), IP-DEMUX (demux0) and VLAN-DEMUX (demux0) Logical Interfaces, the queue mapping might get wrong. The traffic forwarding might be affected. PR1135862
While bringing down subscribers, the system generates [ Deinstantiate Service Failed permanently, daemon: cosd ] error message. PR1136083
MXVC-Same subnet VC-heartbeat polling failed to recover. PR1136119
On MX Series platforms with MIC3-3D-1X100GE-CFP, after In-Service Software Upgrade (ISSU), the Junos upgrade is successful, but the 100GE port will be down, and the traffic forwarding will be affected. PR1136269
In IGMP over subscriber environment with configuration statement "remove-when-no-subscribers" configured, after performing graceful Routing Engine switchover, subscribers with multicast joins cannot re-login when subscriber logout before it sends IGMP leave in new master. PR1136646
On MS-MIC, TCP session Up/Down causes JSERVICES_NAT_* and JSERVICES_SESSION_* messages though severity level "none" is configured for services. PR1137596
JNH periodically attempts to recover memory no longer in use. Recently when Firewall address space was expanded to 16M, a side effect was triggered -- memory recovery was extended to 16M as well. On the Hercules line card, Firewall does not use a small block of IDMEM, causing JNH to attempt the return of the unused memory. There is no mechanism for recovery of IDMEM, therefore, this message is displayed. Excepting the syslog impact, there is no further effect on the line card. PR1140021
From Junos OS Release 14.1R4, 14.2R3, 15.1 and later, when firewall filter is applied to NG-MPC, after system reboot, Routing Engine might go into amnesiac mode. PR1141101
In subscriber management environment, on MX Series platform, after login/logout static subscribers (e.g. by setting/deleting the interface), some of the static subscribers may get stuck in "Terminated" state. PR1143205
When multicast-only fast reroute (MoFRR) is enabled in PIM or multipoint LDP domain, memory leak will be observed on generation of the multicast FRR next-hops. The leak rate is 8-byte for IPv4 and 12-byte for IPv6 addresses, per FRR next-hop created. Eventually, the rpd process will run out of memory and crash when it cannot honor some request for a memory allocation. PR1144385
When ARP is trying to receive a nexthop message whose size (for example 73900 bytes) is bigger than its entire socket receive buffer (65536 bytes), the kernel might crash, and the traffic forwarding might be affected. PR1145920
On MX Series routers with "subscriber-management" enabled, the BBE subscriber management daemon (bbe-smgd) might crash on the backup Routing Engine when performing graceful Routing Engine switchover (GRES) during subscribers concurrent login/logout. PR1147498
On MX Series platform, in multicast subscriber management environment (e.g. IGMP is configured for subscribers in dynamic profile), when nonstop active routing (NSR) is enabled, if the routing protocol process (rpd) is busy or there are hundreds of multicast groups are active (e.g., 250), missing multicast entries issue might be seen after performing Routing Engine switchover twice or more (i.e., first Routing Engine switchover works fine, and the issue may occur from the second switchover and onward). As a workaround, this issue could be avoided by issuing CLI command "restart smg-service" on backup Routing Engine after every switchover. PR1149065
When a routing instance is configured with "routing-instances <instance name> routing-options localized-fib" then VPN localization may fail, causing all routes for the affected routing instance to be installed on all Packet Forwarding Engines. PR1149840
Commit error after attempting to delete all guaranteed rates on all traffic-control-profiles associated with demux0 [edit] lab@mx480-J12_09# commit re0: [edit class-of-service interfaces] 'demux0' IFL excess rate not allowed on interface (demux0), please specify guaranteed rate on at least one IFL error: configuration check-out failed. PR1150156
When using type 5 FPC on T4000 platform, traffic go out of the interface where "source-class-usage output" is configured will be dropped if the Source class usage (SCU) or Destination Class Usage (DCU) policy configuration is missing. This issue is caused by incomplete configuration so, to avoid the issue, please make the configuration complete (e.g. with "source-class-usage output" and SCU policy). PR1151503
In the TXP environment, the Line-Card Chassis (LCC) Switch Interface Board (SIB) status is not right when execute command "user@router> show chassis environment", their status are Absent, but no alarms. This is a minor issue, it does not affect business. PR1156841
A previous enhancement to strengthen the VC-Heartbeat message exchange resulted rejecting messages at the crucial time of determining the health of the other VC member when all adjacency links fail. Validation of messages has been adjusted to remain strong when the VC is connected, but relaxed during the split conditions to prevent rejecting valid messages. PR1157383
In PPPoEv6 scenario, the unsolicited Router Advertisement will be sent out before get IPCPv6 ack. This behavior will impact PPPoEv6 connection rate. We can use "no-unsolicited-ra" configuration statement to suppress this message as a workaround. But in this case, this configuration statement does not work. The unsolicited Router Advertisement will still be sent out. PR1158476
High Availability (HA) and Resiliency
On MX Series platforms with Junos OS Release 15.1R1 and later, while a core dump is in progress, if we try to access the dump directory, due to the deadlock defect, the system might hang and crash. As a workaround, we should not access the "/var/crash" directory till the core dump is complete. PR1087082
On MX Series Virtual Chassis (MX-VC) with scaled configuration, for example, 110000 DHCP and 11600 PPP subscribers, the unified in-service software upgrade (ISSU) might fail due to the management daemon (MGD) timer expiring before Field-replaceable units (FRUs) update finish. PR1121826
On MX240/480/960/2010/2020 platform with Junos OS Release 15.1R1 and and later, in high scale scenario (e.g., there are 4 million routes or more), the connection between Routing Engine and the FPC(s) may flap after performing graceful Routing Engine switchover (GRES). The other symptoms are intermittent packet drops between the Routing Engine and FPC during regular operation without performing GRES and scaled scenario. PR1146548
Infrastructure
Only the following directories and files are preserved when upgrading from build prior to 15.1 to 15.1 (FreeBSD 10) . config/ /etc/localtime /var/db/ /var/etc/master.passwd /var/etc/inetd.conf /var/etc/pam.conf /var/etc/resolv.conf /var/etc/syslog.conf /var/etc/localtime /var/etc/exports /var/etc/extensions.allow /var/preserve/ /var/tmp/baseline-config.conf /var/tmp/preinstall_boot_loader.conf Anything else not listed above is deleted/formatted during upgrading to freebsd10 version of Junos OS.PR959012
When "show version detail" CLI command has been executed, it will call a separate gstatd process with parameter "-vvX". Because the gstatd could not recognize these parameters, it will run once without any parameter then exit. In result of "show version detail", following information could be seen: user@hostA> show version detail Hostname: hostA Model: mx960 Junos: 13.3R6-S3 JUNOS Base OS boot [13.3R6-S3] JUNOS Base OS Software Suite [13.3R6-S3] .. <snipped> file: illegal option -- v usage: gstatd [-N] gstatd: illegal option -- v usage: gstatd [-N] <snipped> At the same time, log lines like following might be recorded in syslog: file: gstatd is starting. file: re-initializing gstatd mgd[14304]: UI_CHILD_START: Starting child '/usr/sbin/gstatd' gstatd: gstatd is starting. gstatd: re-initializing gstatd gstatd: Monitoring ad2 gstatd: switchover enabled gstatd: read threshold = 1000.00 gstatd: write threshold = 1000.00 gstatd: sampling interval = 1 gstatd: averaged over = 30 mx960 mgd[14304]: UI_CHILD_STATUS: Cleanup child '/usr/sbin/gstatd', PID 14363, status 0x4000 mgd[14304]: UI_CHILD_EXITED: Child exited: PID 14363, status 64, command '/usr/sbin/gstatd' PR1078702
On dual Routing Engine platforms, if GRES is configured (triggered by "on-disk-failure"), when a disk I/O failure occurs on the master Routing Engine due to hardware issue (for example, SSD failure), the graceful Routing Engine switchover might not be triggered immediately after initial IO failure has been detected. As a result, the Routing Engine might enter a state in which it responds to local pings and interfaces remain up, but no other processes are responding. PR1102978
With scaled configuration or there are memory leaks, if the virtual memory is running very low, the kernel might crash and the device will go in db prompt continuously due to a recursion issue. PR1117548
The "show route vpn-localization" command does not have any output, but if xml format requested then xml output of the same command works. PR1125280
In scaling setup (in this case, there are 1000 VLANs, 1000 Bridge Domains, 120 IRB interfaces, 120 VRRP instances, BGP and IGP), if the routing protocols are deactivated and activated, there might be a chance that the pending route stats are not cleaned up, which will cause the stats infra to have stale pointers and lead to memory corruption in socket layers. The system might go to db prompt because of this. All the traffic goes through the router will be dropped. PR1146720
Interfaces and Chassis
Reconfiguring lt- interface causes dcd memory leak.PR 879949
On MX Series routers, the physical or logical interfaces (ifd/ifl) might be created and marked UP before a resetting FPCs' fabric planes are brought up and ready to forward traffic, as a result, traffic might be black-holed during the time window. This window of traffic black-hole is particular long if the chassis is heavily populated with line-cards, for example, the router has large scale of configuration (routes or subscribers), and coupled with a lot of FPC reset, such as upon a node power up/reset. PR918324
jnxBoxDescr is reworded for MXVC to replace the platform type with a more general representation that replaces the specific member platform type with "Virtual Chassis". Old virtual chassis text example: jnxBoxDescr.0 = member0 Juniper MX240 Internet Backbone Router New virtual chassis text example: jnxBoxDescr.0 = member0 Juniper MX Virtual Chassis Internet Backbone Router NOTE: The MIB design for jnxBoxAnatomy "top-level" chassis information works properly for a standalone chassis, but doesn't fully represent virtual chassis multi-member configurations because it is capable of providing information for only one physical chassis. (The remainder of the jnxBoxAnatomy MIB "containers" properly support the inventory of a multi-member configuration.) MX virtual chassis provides another MIB, jnxVirtualChassisMemberTable, to supply the equivalent "top-level" information. PR1024660
When issuing a CFM LTR from CE, link state reply, recieved from MX Series, acting as MHF doesn't contain Reply Egress TLV if ingress and igress IFL are located on the same Physical Interface PR1044589
MS-DPC might crash when allocating chain-composite nexthop in enhanced LAG scenario. PR1058699
During subscriber login/logout the below error log might occur on the device configured with GRES/NSR. /kernel: if_process_obj_index: Zero length TLV! /kernel: if_pfe: Zero length TLV (pp0.1073751222). PR1058958
Currently the redundant logical tunnel (rlt) interface only supports limited vlan range (0..1023), it should support the extended vlan range (0..4094) as the logical tunnel does. PR1085565
Trap messages does not logged on logical interface (ifl) after deleting "no-traps" configuration statement, in spite of setting explicit "traps". PR1087913
The Enhanced LAG feature is enable in network-service enhanced-ip mode, but it is not supported in enhanced-ethernet mode. PR1087982
During scaling login/logout different types of subscribers (e.g. 17K) on LAC router, there might be some L2TP LAC subscribers stuck in terminating state and never get cleared, blocking new sessions from establishing on the same interface. PR1094470
When DHCP subscribers are terminated at specific routing-instances and the interface stack is IP demux over vlan-subinterface over AE interface, there might be a memory leak in kernel AE iffamily when subscribers login/logout. PR1097824
The adaptive load balancing counters are always zero for aggregated Ethernet (AE) bundles on MICs or MPCs of MX Series routers. PR1101257
VRRP inet6 group interface does not send Router Advertisement (RA) when the interface address and virtual address are same. run show ipv6 router-advertisement interface ge-0/2/0.430 Interface: ge-0/2/0.430 Advertisements sent: 0 Solicits received: 0 Advertisements received: 0 PR1101685
With "enhanced-ip" mode and AE interface configured, if SCU/DCU accounting is enabled, the MS-DPC might drop all traffic as regular discard. PR1103669
The 'optics' option will now display data for VCP ports: show interfaces diagnostics optics vcp-0/0/0 PR1106105
On MX240 or MX480 platform with at least two DC modules (PN: 740-027736) equipped, when shutting down one of the PEMs and then turn it on again, even the PEM is functioning, the "PEM Fan Fail" alarm might be observed on the device due to software logic bug. There is no way to clear the ALARM_REASON_PS_FAN_FAIL for I2C_ID_ENH_CALYPSO_DC_PEM once it has been raised. PR1106998
On MPC-3D-16XGE-SFPP line card, when an optics (for example, 10G-LR-SFP) is disabled and then enabled administratively, if the SFP is not temperature tolerant (non-NEBS compliant), the TX laser may not be turned on due to the fact that the chassis process (chassisd) may keep sending the "disable-non-nebs-optics" command to the optics if the current temperature of FPC reaches the threshold temperature. PR1107242
On MX Series platforms, continuous error messages might be seen on the MICs (for 10G/40G/100G MICs) from MIC3 onwards (listed as below) when physical interface (IFD) settings are pushed (e.g. booting the MPC). Based on the current observation, the issue may not have any operational impact and the MICs that may encounter this issue are listed as below, - 10G MICs: MIC3-3D-10XGE-SFPP, MIC6-10G, MIC6-10G-OTN, - 40G MICs: MIC3-3D-2X40GE-QSFPP, - 100G MICs: MIC3-3D-1X100GE-CFP, MIC3-3D-1X100GE-CXP, MIC6-100G-CXP, MIC6-100G-CFP2 PR1108769
Junos OS now checks ifl information under the ae interface and prints only if it is part of it PR1114110
The jpppd process (which is used to authenticate subscribers) might crash after restarting MPC in live network, and then some subscribers might be found stuck in INIT state. PR1114851
In PPPoE subscriber management environment, when dynamic VLAN subscriber interfaces is created based on Agent Circuit Identifier (ACI) Information, the subscribers might unable to login after reboot FPC with syslog "Dropping PADI due to no ACI IFLSET". PR1117070
When an M120/M320/MX Series router acts as the Broadband network gateway (BNG) and provide the PPPoE subscriber management service, after Routing Engine switchover, it might wrongly send out IPCP Term-Req message. It will cause PPPoE subscribers login failure. PR1117213
When using Ethernet OAM Connectivity Fault Management (CFM), the CFM process (CFMD) may crash in either of the following scenarios, - Scenario 1 When CFMD is restarted or GRES. There is no specific defined configuration which could cause this crash, but normally this would be seen with VPLS or Bridge domain with multiple Mesh-groups. The crash happens rarely in this scenario. - Scenario 2 When configuring 2 interfaces in the same bridge-domain (BD) or routing-instance, and both interfaces have maintenance association end point (MEP) configuration along with action-profile enabled. Also there is no maintenance association intermediate point (MIP) configuration on that BD or routing-instance. The crash might be seen with the above configurations and when one of the interfaces is flapped or deleted and then re-created. In addition, in this scenario, this issue may not happen always as this depends on the ordering of kernel event. PR1120387
The jpppd process might crash and restart due to a stale memory reference. The jpppd process restart results in a minimal impact of system and subscribers. All connected subscribers remain connected and only subscribers are attempting to connect at time of process restart would need to retry. PR1121326
On Junos OS platforms, an aggregate-ethernet bundle having more-than one member link can show incorrect speed which would not match to the total aggregate bandwidth of all member links. The issue would be seen when LFM is enabled on the aggregate-ethernet bundle. The issue would be triggered when one of the member link flaps. Although after the flap, the current master Routing Engine would show correct aggregate speed, the backup Routing Engine would report incorrect value. In this state, when Routing Engine mastership is switched, the new master Routing Engine (which was backup) will show incorrect value. One of the side-effect of this issue is that RSVP also reflects incorrect bandwidth availability for the affected aggregate-ethernet bundle, thus can cause under-utilization of the link with LSP having bandwidth constraints. PR1121631
Since a bug which was introduced in Junos OS Release 15.1R1, loopback sub-interfaces always have a Flag down in the output of CLI command "show interfaces". PR1123618
The connectivity fault management (CFM) log message "Adjacency up" should only be logged when the router first detects remote MEP or the peer interface goes down and up causing adjacency failure for this remote MEP. But now it is wrongly logged when any peer set/clear the Remote defect indication (RDI) bit in continuity check messages (CCMs). PR1125164
If two redundant logical tunnels (rlt) sub-interfaces are configured in a same subnet and in a same routing-instance, a sub-interface will be down (this is expected), but if the sub-interface is removed from the routing-instance later, after disable and enable the rlt interface, a sub-interface might remain in down state unless removing configuration of rlt interface and then rollback. PR1127200
With incomplete cfmd configuration, for example, only MD (maintenance-domain) configured and no MA (maintenance-association) configured, or MD and MA configured but no MEP configured, SNMP walk in CFM MD table results in infinite loop and process cfmd is spinning at around 90% CPU. PR1129652
In Dynamic PPPoE subscriber management scenario, when the system is overloaded with requests coming, the subscribers might fail to login in a race condition. PR1130546
The jpppd process might crash and restart due to a buffer overwrite. The jpppd process restart results in a minimal impact of system and subscribers. All connected subscribers remain connected and only subscribers are attempting to connect at time of process restart would need to retry. PR1132373
MX-VC specific behavior for SNMP walk of jnxOperating* containers was divergent from physical MX. Returned to vergence. PR1136414
On MX Series platforms, the "Max Power Consumption" of MPC Type 1 3D (model number: MX-MPC1-3D) would exceed the default value due to software issue. For example, the value might be shown as 368 Watts instead of 239 Watts when "max ambient temperature" is 55 degrees Celsius. PR1137925
When Micro Bidirectional Forwarding Detection (BFD) sessions are configured for link aggregation group (LAG), the device control process (DCD) acts as the client to the micro BFD session. In order to monitor the connection between client (DCD) and server(BFD), client needs to exchange keep alive hello packets with the server. To send hello packets, DCD needs to move out of IDLE phase to CONFIG_BFD phase which is the reason for below log messages: dcd.c:585 dcd_new_phase_if_idle() INFO : Current phase is IDLE, going to phase CONFIG_BFD usage.c:75 dcd_trace_times() INFO : Phase Usage for IDLE : user 0.001 s, sys 0.000 s, wall 60.019 s dcd.c:717 dcd_new_phase() INFO : New phase is CONFIG_BFD usage.c:75 dcd_trace_times() INFO : Phase Usage for CONFIG_BFD : user 0.000 s, sys 0.000 s, wall 0.000 s dcd.c:717 dcd_new_phase() INFO : New phase is IDLE There is no functionality impact, however these messages may flood the logs. As a workaround, we can filter out these messages from being written to the log file according to this KB article. PR1144093
In MX-VC or VRR platforms running releases of 15.1 built before about February 2016, the following cosmetic warning message will be print upon commit: [edit] 'chassis' warning: WARNING: MPC reboot or chassis reboot is required to use MIC aware dynamic power management feature on already plugged-in MPCs. PR1144295
The alarm "CB 0 ESW Packet Forwarding Engine Some Ports Failed " was triggered by the difference "rcb_handle_esw_port_status Some Port Lost Connection online_mask" between CB0 and CB1, But the issued mask-bit was directed to an none-existed FEB. PR1148869
When using MX Series platform as Layer 2 Tunnel Protocol (L2TP) L2TP access concentrator (LAC), if login/logout tunneled PPPoE subscribers over an extended period (e.g. login/logout 16K subscribers for 24- 48 hours), kernel crash may occur due to next-hop issue. PR1150316
The outbound PPPoE control packets sourced from PPPoE daemon, such as PADO, error PADS and PADT, are assigned to queue 0 instead of queue 3. PR1154070
Customer may see errors when doing 'show interface interface-set queue <if set>' for a pure numeric interface-set name. router> show interfaces interface-set queue 803 error: can't decode interface name `803': invalid device name. PR1154667
Internal timing for bringing FPCs online is extended for MX 2020/2010 systems to accommodate longer initialization times for fabric and FPCs. PR1164147
Layer 2 Features
In LDP Hierarchical VPLS (H-VPLS) topology (for example, the Multi-Tenant Unit switch (MTU-s) is connected to two PE devices via a primary spoke PW and backup spoke PW), when the primary spoke PW is down, an LDP address withdraw message with TLVs 0x404 and 0x405, which means "flush-all-from-me", will be sent from the PE (for example, PE1) on detection of failure of the primary spoke PW to peer PE devices participating in the full mesh to flush the MAC addresses learned in the corresponding Virtual Switch Instance (VSI). After receiving the message by a PE (for example, PE2) with "mac-flush propagate" configuration statement configured, the expectation is propagating "flush-all-from-me" to other participating PE (for example, PE3), but instead, it sends 'flush-all-but-me' message incorrectly. Because of this, the receiving PE (for example, PE3) will flush all MAC entries it learned, except the ones that were learned from LSI interface to sending PE (for example, PE2). PR1131439
In VPLS scenario with AE interfaces as core facing interfaces, when LDP mesh-group is enabled with local-switching enabled in it, the neighbors configured under the local-switching hierarchical will cause LSI (Label-Switched Interface) to be created automatically. If port flapping occurs causing MPLS interface change associated with the LSI interface, the VPLS split-horizon might not be in functionality, this will cause traffic to be looped back. As a workaround, configuring configuration statement "enhanced-ip" can avoid this issue. PR1138842
When configuring the "ecmp-alb" configuration statement to enable adaptive load balancing for equal-cost multipath (ECMP) next hops, the VPLS broadcast, unknown unicast, and multicast (BUM) traffic might be dropped on egress Packet Forwarding Engine when ingress/egress interfaces are distributed to more than one Packet Forwarding Engines. As a workaround, we can disable "ecmp-alb" to avoid this issue. PR1142869
In a VPLS scenario, when "$junos-underlying-interface-unit" is configured in "dynamic-profiles" hierarchy, which is then implemented in a routing-instance. The upgrade/commit will fail with the following error message, Parse of the dynamic profile (<dynamic_profile_name>) for the interface: $junos-interface-ifd-name and unit: $junos-underlying-interface-unit failed! PR1147990
For routers equipped with the following line cards: T4000-FPC5-3D MX-MPC3E-3D MPC4E-3D-32XGE-SFPP MPC4E-3D-2CGE-8XGE MPC5E-40G10G MPC5EQ-40G10G MPC6E MX2K-MPC6E. If the router is working as VPLS PE, due to MAC aging every 5 minutes, the VPLS unicast traffic is flooded as unknown unicast every 5 minutes. PR1148971
In subscriber management environment, when login/logout the subscribers, if the accounting feature is enabled as well as the underlying interface is configured with dynamic VLAN (DVLAN), the memory leak in "/mfs" may occur due to incorrect interaction between Packet Forwarding Engine process (pfed) and authentication process (authd). PR1112333
There is a bug in code of handling the redistribution of PPM (periodic packet management) Transmit and Adjacency entries for LACP, when the Interface entry is in pending distribution state. This issue might cause ppmd crash after graceful Routing Engine switchover. PR1116741
For Routing Engine generated packet with VLAN tag, if the outgoing interface is an LT interface, the VLAN tag will not be removed even the LT interface is configured with untagged encapsulation. PR1118540
For PVSTP/VSTP protocols, when MX/EX92xx router inter-operates with Cisco devices, due to the incompatible BPDU format (there are additional 8 Bytes after the required PVID TLV in the BPDU for Cisco device), the MX might drop these BPDUs. PR1120688
In the DHCPv4 or DHCPv6 relay environment with large scaled environment (in this case, 50-60K subscribers), and the system is under stress (many simultaneous operations). The subscribers might get stuck in RELEASE state with large negative lease time. PR1125189
In scenario that DHCP relay is used along with Virtual Extensible Local Area Network (VXLAN), if DHCP discover packet is received with the broadcast bit set via a VXLAN interface on MX platform (which is acting as DHCP relay), the OFFER back from the DHCP server will not be forwarded back to the client over the VXLAN interface. Unicast offers (that is, DHCP offer packet with unicast bit set) over VXLAN and both broadcast and unicast offers over native VLAN interfaces work fine. PR1126909
In some rare scenarios, the MVRP PDU might unable to be transmitted, which could cause memory leak in layer 2 control plane daemon (l2cpd), and finally results in the l2cpd process crash. PR1127146
Input/Output pps/bps statistics might not be zero after a member link of AE interface with distributed ppmd was down in M320/T-Series(GIMLET/STOLI based FPC). PR1132562
The "Node ID" information is not shown on MX platform when traceoption flag "pdu" is configured to trace Ethernet ring protection switching (ERPS) PDU reception and transmission. PR1157219
DHCP relay with forward-only cross-VRF results in bad packet format of the DHCP DISCOVER packet. Wireshark decode of packets from MX Series to DHCP server indicate Error; End options missing. PR1157800
MPLS
With egress protection configured for Layer 3 VPN services to protect the services from egress PE node failure in a scenario where the CE site is multihomed with more than one PE router, when the egress-protection is un-configured, the egress-protection route cleanup is not handled properly and still point to the indirect composite nexthop in kernel, but the composite nexthop can be deleted in rpd even the egress protection route is pointing to the composite nexthop. This is resulting in composite nexthop "File exists" error when the egress protection is re-enabled and reuse the composite nexthop (new CNH addition fails as old CNH is still referenced in kernel). PR954154
In MPLS scenarios, removing the "family mpls" configuration from an outgoing interface may cause inet and/or inet6 nexthops associated with that interface to unexpectedly transit to dead state. Even adding back "family mpls" cannot restore it. PR1067915
Junk characters are being displayed in output of show connections extensive command. PR1081678
When an LSP is link-protected and has no-local-reversion configured, if the primary link (link1) is down and LSP on bypass (link2), then another link (link3) is brought up, before the LSP switch to link3, if link1 is enabled and link3 is disabled, the LSP will stuck in bypass LSP forever. This is a timing issue. PR1091774
If LDP is enabled via the 'protocols ldp' configuration option on a device running Junos OS, receipt of a spoofed, crafted LDP packet may cause the RPD routing process to crash and restart. PR1096835
From Junos OS Release 13.2R1 and later, in MPLS L3VPN scenario, when the "l3vpn-composite-nexthop" configuration statement is enabled on a PE router and an interface style service set is attached to the ingress interface, the L3VPN packets with the MPLS labels will be sent to the service card and dropped. As a workaround, we should disable "l3vpn-composite-nexthop". PR1109948
If "optimize-timer" is configured under P2MP branch LSP, this branch LSP will not be re-established if link flap on egress node. If "optimize-timer" is configured at protocols/mpls level, issue could be avoided. PR1113634
For advertising IPV6 packets over MPLS GRE tunnel, the IPv6 address gets stuck in KRT queue. PR1113967
For an MPLS L3VPN using LDP-signaled LSPs, in a rare racing condition (e.g. large-scale environment or Routing Engine CPU utilization is high), the rpd process might crash after an LDP neighbor down. PR1115004
If an RSVP LSP has both primaryand secondary standby path and link-protection enabled, a /32 bypass route is unhidden when the primary link goes down. This /32 route is supposed to be made hidden again when primary link comes back up. But in some cases, due to software defect, this /32 bypass route remains unhidden forever which causes some issues, for example, BFD session down due to better prefix received from Bypass LSP. PR1115895
When multipoint LDP (M-LDP) in-band signaling is enabled to carry multicast traffic across an existing IP/MPLS backbone and routing process is enabled to use 64-bit mode, the rpd might crash due to accessing uninitialized local variables. PR1118459
When OSPF LFA is enabled and there is available backup path, after clearing the LDP session to the primary path or backup path, in a very rare condition, the LDP session on this router might flap multiple times. PR1119700
When local bandwidth accounting for inactive/adaptive standby path figures that there is not enough bandwidth (due to double-counting BW on common link shared by primary and secondary path) to fit it in an already full link and brings it down, CSPF will not be retried on the path unless there is some change in TE database. PR1129602
When an PLR is a non-Juniper router, Juniper ingress node might stay on the bypass tunnel and ignore the CSPF result. PR1138252
When a link fails on an RSVP LSP which has link-protection or node-link-protection configured, the PLR (point of local repair) will initiate a bypass LSP and the RSVP LSP will be tunneled on this bypass LSP. However, if now the bypass LSP is brought down because there is a link failure on it, the PLR might only send out session_preemted PathErr message to the upstream node without sending ResvTear message. Hence the ingress node does not receive ResvTear message and the RSVP LSP is not immediately torn down. The RSVP LSP will remain UP for more than 2 minutes until the RSB (Resv sate block) on the ingress's downstream node gets time out and it sends ResvTear message to the ingress. PR1140177
There is no entropy label for LDP route in scenario of LDP tunneling across a single hop RSVP LSP with label 0 (explicit-null) used. As workaround, either remove LDP tunneling or RSVP explicit-null will resolve the issue. PR1142357
This issue is related to inter-op between multi vendor scenario. This fix will add sub-object RRO which will help change of label during FRR active scenario. PR1145627
With NSR enabled and LDP configured, the rpd process may crash and restart on the new master Routing Engine after a Routing Engine switchover. PR1155002
Network Management and Monitoring
On Junos OS Release 13.1X42/14.1X51/15.1R1/15.1R2, the SNMP average response time in the output of "show snmp statistics extensive" is wrongly calculated and might be observed with negative value. PR1112521
While the router is rebooting and SNMP polling is not stopped, SNMP requests might land on mib2d process before Routing Engine protocol mastership is resolved, causing the mib2d process crash. PR1114001
The SNMPv3 message header has a 4-byte msgID filed, which should be in (0....2147483647), when the snmpd process has been running for a long time, the msgID might cross the RFC defined range and causing Net-SNMP errors, "Received bad msgID". PR1123832
From Junos OS Release 14.1R1, SNMP informs are not sent out to the network management system (NMS) when significant events occur on a Junos device. As a workaround, we can configure an dummy trap-group. PR1127734
A merge conflict was incorrectly resolved by changing snmp trap value of jnxDomLaneNotifications to 26. The correct value will always be 25. PR1145144
With Junos OS Release 13.3R8/14.1R6/14.1X53-D30/14.2R5/15.1R2/15.1X49-D30 and later, when we configure fxp0 "master-only" address as source address of snmp trap, the snmp trap packets are not sent out after Routing Engine switchover. To restore this issue, we can use "restart snmp" or "delete/set snmp trap-options". As a workaround, we can use other addresses for snmp trap source. PR1153722
Platform and Infrastructure
In the Network Time Protocol (NTP) configuration, if the specified source ip address is not in current routing-instance, the router will use primary address of interface (which will be used to send packet) as source address, Client routers will treat the NTP packets as incorrect packets, and then NTP synchronization failed. PR872609
On MX Series based line cards, when GRE keepalive packets are received on a Packet Forwarding Engine that is different from the tunnel interface hosted, the keepalive message will apply the firewall filter configured on default instance loopback interface. PR934654
Bad udp checksum for incoming DHCPv6 packets as shown in monitor traffic interface output. The UDP packet processing is normal, this is a monitor traffic issue as system decodes checksum=0000. PR948058
When using MX2020 platform in Virtual Chassis (VC) environment, if the Virtual Chassis port (VCP) is located on the local Packet Forwarding Engine whose number is greater than 63 (i.e. VCP is located on local slot 16 or higher), the multicast traffic that should be sent to VCP will be dropped internally due to software issue. As a workaround, please configure the VCP ports on local chassis (local MX2020) slot 0 to 15, not 16 or higher. PR1008676
When the 'enhanced-hash-key services-loadbalancing' feature is used by MX Series based line cards, load balancing of flows across multiple service PICs via the source-address across does not work when internal BGP (IBGP) is used to steer traffic to the inside service-interface. For example the operator will see on the stateful firewall that the same source-address has flows across multiple service interfaces. PR1034770
When one of the "deny-commands" is incorrectly defined in the profile of TACACS+ server, all "deny-commands" regexes will be ignored, which leads to an over-permissive profile without any warning. PR1078238
If with both MPC/MSDPC and other type of DPCs equipped, for local switching at mesh group level, split horizon on PW interfaces will not work and this would cause packets to loop back to same PW interface. PR1084130
The MIB counter or "show pfe statistics traffic" shows junk PPS and invalid total traffic output counter. PR1084515
In 64-bit Junos OS environment, the Representational State Transfer (REST) API fails to start when configured with "set system services rest ...". PR1097266
On MX Series platform, if ingress "multicast-replication" is configured, the throughput of the multicast may get reduced due to unnecessary threads during Packet Forwarding Engine operation. In addition, only the performance of multicast traffic may get influenced (some of the multicast packets may get dropped on the Packet Forwarding Engine) by the issue. This PR has fixed/enhanced the performance. Now the performance limit should only be capped by fabric bandwidth in ingress Packet Forwarding Engine. In addition, before this fix, there was a limitation that VPLS/Bridging cannot run with ingress-replication feature as its BUM traffic cannot be handled by ingress-replication feature . This PR removed that limitation as well. Now BUM traffic for VPLS/Bridging is following normal multicast replication path even with ingress-replication feature. PR1098489
With ECMP-FRR enabled, after rebooting the FPC which hoisting some ECMP links, the ECMP-FRR might not work. Clear any of BGP sessions (that is the part of ECMP) could help to clear this issue. PR1101051
The kernel next-hop acknowledgement timeout maximum interval configured (krt-nexthop-ack-timeout) under the CLI hierarchy "routing-options forwarding-table" has been increase to 400 seconds to avoid performance issues with scaled subscribers. PR1102346
On an MPC3E or MPC4E or on an EX9200-2C-8XS line card, when the flow-detection feature is enabled under the [edit system ddos-protection] hierarchy, if suspicious control flows are received, two issues might occur on the device: ? The suspicious control flow might not be detected on the MPC or line card. ? After suspicious control flows are detected, they might never time out, even if traffic flows no longer violate control parameters. PR1102997
The following fields have been added to v10 Sampling (IPFIX) template and data packets: - SAMPLING RATE - SAMPLING INACTIVE TIMEOUT - SAMPLING ACTIVE TIMEOUT - TOTAL PACKETS EXPORTED - TOTAL FLOWS EXPORTED. PR1103251
Improved VTY commands to show internal JNH memory usage. PR1103660
On MX Series Virtual Chassis (MX-VC) with "locality-bias" configured, when equal-cost multipath (ECMP) load-balancing is occurring in the VC system, multicast streams and flooded Layer 2 streams may be duplicated or lost. As a workaround, we can disable "locality-bias" if possible. PR1104096
Junos defines SNMP ifXTable (ifJnxInErrors/ifJnxInL3Incompletes) counter as 64-bit width, but it worked as 32-bit width counter. It works as 64-bit width counter after the fix. PR1105266
Any configuration or logical interface (IFL) change will introduce 160 bits (20 bytes) memory leak on MPC heap memory when we have any type of inline sampling configured (ipfix or version 9). Only trigger of issue is the configuration of inline sampling, even without traffic being sampled. The leak is more evident in a subscriber management scenario when we have many IFL addition/deletion. Rebooting MPC in a controlled maintenance window is the only way to restore memory. PR1105644
On MX Series-based platforms, in MX Series Virtual Chassis (MXVC) environment, if the subscriber logical interface (IFL) index 65793 is created (for example, when carrying 15K DHCPv4 subscribers to exceed IFL index creation 65793) and the IEEE 802.1p rewrite rule is configured (for example, using CoS rewrite rules for host outbound traffic), due to usage of incorrect IFL index, the Virtual Chassis Control Protocol Daemon (vccpd) packets (for example, Hello packets) transmission may get lost on all VC interfaces, which may lead to VC decouple (split brain state, where the cluster breaks into separate parts). As a workaround, either delete the rewrite rule (delete class-of-service host-outbound-traffic ieee-802.1 rewrite-rules), or find the IFL in jnh packet trace that is not completing the vccpd send to other chassis and at Routing Engine clear that subscriber interface may resolve the issue. PR1105929
When a common scheduler is shared by multiple scheduler maps which applies to different VLANs of an Aggregated Ethernet (AE) interface, if the configuration statement "member-link-scheduler" is configured at "scale", for some VLANs, the scheduler parameters are wrongly scaled among AE member links. As a workaround, we should explicitly configure different schedulers under the scheduler maps. PR1107013
CVE-2015-5477 A vulnerability in ISC BIND's handling of queries for TKEY records may allow remote attackers to terminate the daemon process on an assertion failure. See this KB article. PR1108761
DHCP End options (option 255) is missing by DHCP-relay agent (where 20 bytes DHCP options 82 inserted) for client DHCP discover message with 19 bytes padding. PR1110939
On MX-VC, when traffic with TPID 0x88a8 or 0x9100 is sending over AE interface, the packets which across VCP links might be dropped on egress VCP Packet Forwarding Engine due to invalid fabric token. PR1112752
When inline BFD sessions and inline jflow are configured on the same Packet Forwarding Engine, with the increasing of active flows (about 65k), the BFD session might flap constantly and randomly due to the outgoing BFD packets are dropped. PR1116886
Inline 6rd and 6to4 support for XL and XL-XM based platforms. PR1116924
On MX Series-based FPC, when MPLS-labled fragmented IPv6 packets arriving at PE router (usually seen in 6PE and 6VPE scenario), the Packet Forwarding Engine might mistakenly detect such IPv6 header and then drop these packets as "L3 incompletes" in the output of show interface extensive. PR1117064
When inline static NAT translation is used, if two rules defined in two service sets are pointing to the same source-prefix or destination-prefix, changing the prefix of one of the rule and then rolling back the changes is not changing back all the pools correctly. PR1117197
On MX Series-based line cards, the firewall filter may have some issues when matching on Authentication Header (AH) protocol. This can affect VRRP (among others) when authentication is used, and an Routing Engine firewall filter is matching on protocol AH. As a workaround, we can change the filter to match on other criteria (e.g. source or destination address). PR1118824
Tnetd is a daemon used for internal communication between different components like Routing Engine and Packet Forwarding Engines. It is used mainly to initialize the right server for rsh, rcp, rlogin, tftp, or bootp clients. It might crash occasionally due to the tnetd process not handling signals properly. PR1119168
After changing an outer vlan-tags, the ifl is getting programmed with incorrect stp state (discarding), so the traffic is getting dropped. PR1121564
With "fast-synchronize" configured, adding a new configuration-group that has configuration relevant to the rpd process and apply it and commit, then any configuration commits might cause the rpd process on the backup Routing Engine crash. We can reboot the backup Routing Engine to restore. PR1122057
MX2020 or MX2010 running Junos OS software version 15.1 may experience "Minor" alarm associated with "i2c accelerator" timedout messages. PR1122821
On MX Series-based platforms, when fragmented packets go through the inline NAT (including source NAT, destination NAT, and twice NAT), the TCP/UDP checksum would not be correctly updated. In this situation, checksum error would occur on the remote end (inside and outside device). Non-fragmented packets would not be affected by the issue. If possible, this issue could be avoided by either of the following workarounds, * Enable "ignore-TCP/UDP-Checksum errors" at the inside or outside device which processes TCP/UDP data OR * Make sure there will not be any fragments subjected to inline NAT functionality by appropriate MTU adjustment or setting. PR1128671
Parity error at ucode location which has instruction init_xtxn_fields_drop_or_clip will lead to a LU Wedge. LU is lookup ASIC inside the MX Series. The LU wedge will cause the fabric self ping to fail which will lead to a FPC reset. This is a transient HW fault, which will be repaired after the FPC reset. There is no RMA needed unless the same location continues to fail multiple times. PR1129500
On Junos OS devices with DHCP Relay config but without accounting config, and the accounting license does not exist, when the first DHCP control traffic is received, the following subscriber-accounting license grace period alarms might be triggered: alarmd[1650]: Alarm set: License color=YELLOW, class=CHASSIS, reason=License grace period for feature subscriber-accounting(30) is about to expire craftd[1592]: Minor alarm set, License grace period for feature subscriber-accounting(30) is about to expire. PR1129552
For IPv6 packet with "no next header" in Hop-By-Hop header, if the Hop-By-Hop header length field value is large than 112, the router will drop such packet and log the following error: PPE PPE HW Fault Trap: Count 105, PC 60ce, 0x60ce: ipv6_input_finished_parsing LUCHIP(3) PPE_10 Errors lmem addr error. PR1130735
NTP.org published a security advisory for thirteen vulnerabilities in NTP software on Oct 21st, 2015. These vulnerabilities may allow remote unauthenticated attackers to cause Denial(s) of Service(s), disruption of service(s) by modification of time stamps being issued by the NTP server from malicious NTP crafted packets, including maliciously crafted NTP authentication packets and disclosure of information. This can impact DNS services, as well as certificate chains, such as those used in SSL/https communications and allow attackers to maliciously inject invalid certificates as valid which clients would accept as valid. Refer to JSA10711 for more information. PR1132181
Doing a file copy from a Routing-Engine running legacy Junos OS image to a Routing-Engine running Occam based Junos OS image fails. PR1132682
Too many duplicate ACK messages are generated from Packet Forwarding Engine for TCP control connection with Routing Engine. This could cause: 1. MX-VC DDoS protection violation for VC-control low queue and makds MXVC split. 2. Cause Routing Engine and FPC high CPU utilization. PR1133293
With scaled firewall filters attached to interfaces (e.g., 10k+ filters), running the "show configuration" command can cause high CPU of the mgd process. As a workaround, we can use the "show configuration |display set" command to view the configuration. PR1134117
On XM chip based line cards (e.g. MPC3/4/5/6, and FPC type 5), in rare situation, when LU or XL chip congestion occurs (e.g. may occur when configuring with more than 4000 entries in the multicast list and large traffic performing replication, please note this is not a realistic configuration), XM chip wedge may occur. PR1136973
On MX2020, when we remove whole power of a power zone, and then put the power back to the zone, FANTray LED stays Amber and FANTray LED on craft card stays OFF, and do not revert to green (FANTray LED) or ON (Craft LED) until we reboot the entire chassis system or hot swap that FAN tray. For Zone 0(PSM 0 to 8), FAN 1 shows the above described behavior. For Zone 1(PSM 9 to 17), FAN 3 shows the above described behavior. PR1138209
On MX Series platforms with MX Series base line card, si interface is configured (i.e., set chassis fpc 1 pic 2 inline-services bandwidth 1g) and service is configured on the si interface. If Physical Interface is deleted while service is still configured, the FPC might crash. PR1139348
On ungraceful exit of telnet (quit/shell logout), perm and env files created by pam were not deleted. PR1142436
When the CLI command "show pfe statistics exceptions | match reject" is executed CPROD thread in the Packet Forwarding Engine may hog the CPU and result in FPC crash. PR1142823
In certain affected Junos OS releases, executing "nhinfo -d" shell command might trigger a kernel panic. This is caused by insufficient buffer space in the routing socket requested by the "nhinfo" utility. PR1148220
On MX2010 and MX2020 platforms, when error that causes adapter card (ADC)/Switch Fabric Board (SFB) initialization failure occurs (e.g., when Switch Processor Mezzanine Board (SPMB) is bringing up the ADC/SFB which has hardware issue), the SPMB crash occurs. PR1149910
When the NTP server address is configured in Routing Instance table and reachable from inet.0 by static configuration (for example, by configuring static/route/next-table/VRF.inet.0), and NTP source-address is configured, the ntpd (the Network Time Protocol daemon running on NTP client) might pick the wrong source-address instead the configured source-address. As a result, NTP server cannot reply the NTP packet back. PR1150005
Two interrupts are received from the FPGA on the control board of the MX2010/MX2020 platforms for every i2c transaction triggered from software. Only one is expected. PR1151674
On MX Series routers with Junos OS Release 14.2R5-S1, when we specify a multiservice (ms-) interface to add a timestamp to Real-time Performance Monitor (RPM) probe messages, it will cause the mspmand process crash and the MS-MPC/MS-MIC keep crashing. As a workaround, we should configure RPM to perform timestamping either on the Routing Engine (Routing Engine based RPM) or on an installed MPC Packet Forwarding Engine (Inline-RPM). PR1152785
Fixed an issue with Inline Jflow where the Observation Domain field in exported IPFIX datagrams were always using the value attributed for LU0 in MPCs with multiple LUs per forwarding-engine. PR1152854
Fixed an issue on where MX Series cards could crash while programming a firewall filter containing flexible-match-mask. PR1157759
Routing Protocols
On large-scale BGP RIB, advertised-prefixes counter might show the wrong value due to a timing issue. PR1084125
With this change the default label hold timer was increased for 10 seconds to 60 seconds. PR1093638
When a BGP session supports multiple address families, the inactive route of some of the address families might not be flushed correctly, leading to wrong behaviors for some of the features which need to advertise inactive routes(e.g., advertise-inactive, advertise-external, optimal-route-reflection, etc). PR1097297
Due to software bug, Junos OS cannot purge so called doppelganger LSP, if such LSP is received over newly formed adjacency shortly after receiving CSNP from the same neighbor. PR1100756
IGMPv2 working in v2/v1 compatibility mode does not ignore v2 Leave messages received on a bridge-domain's L2 member interface. Moreover, an IGMP snooping membership entry for the respective group at this L2 member interface will be timed out immediately upon IGMPv2 Leave reception, even when there are some other active IGMP hosts attached to this L2 member interface. It might breaks multicast forwarding for this L2 member interface. PR1112354
When two (or more) route target communities of MP-BGP route match to two (or more) route target communities in VRF import policy of a RI duplicate routing entries might be installed in the RI. In the output of 'show route table <RI name>.inet.0 detail' two identical routing entries appear with one being marked as 'Inactive reason: Not Best in its group - No difference'. When such duplicate routing information is to be deleted, rpd process process will crash. PR1113319
When the Multicast Source Discovery Protocol (MSDP) is used, if the RP itself is the First-Hop Router (FHR) (i.e., source is local), the MSDP source active (SA) messages are not getting advertised by the RP to MSDP peers after reverse-path forwarding (RPF) change (e.g., the RPF interface is changed). PR1115494
When a logical unit of an interface is associated with a Bidirectional Forwarding Detection (BFD) session, if changing the unit number of the interface (for example, change the unit number for a running BFD session from ge-1/0/0.2071 to ge-1/0/0.285), the device may fail to change the name due to the missing check for logical interface (IFL) index change. PR1118002
On dual Routing Engine platform with nonstop active routing (NSR) and authentication of the Bidirectional Forwarding Detection (BFD) session enabled, BFD process (bfdd) memory leak may occur on the master Routing Engine and the process may crash periodically once it hits the memory limit (RLIMIT_DATA). The problem does not depend on the scale, but the leak will speed up with more BFD sessions (for instance 50 sessions). As a workaround, if possible, disabling BFD authentication will stop the leak. PR1127367
When protocol MSDP is configured and then deleted, the NSR sync status for MSDP might stuck in "NotStarted", and ISSU might fail on master Routing Engine with reason "CHASSISD_ISSU_ERROR: Daemon ISSU Abort -1(NSR sync not complete: MSDP)". PR1129003
In multicast environment with Protocol Independent Multicast sparse mode (PIM SM) used, if a upstream router of last-hop router receives the (S,G) SPT join while the shortest-path tree (SPT) is not yet established (only because multicast source is not reachable, a reachable route for SPT which is just not established yet will not cause this issue), when the multicast route get deleted on the router (e.g., receives the (S,G) prune from downstream PIM router), the router would incorrectly stop forwarding the multicast traffic even if rendezvous-point tree (RPT) path exists. PR1130279
On dual Routing Engine platforms, due to software issue, OSPF (including both OSPFv2 and OSPFv3) "DoNotAge" bit (e.g. source of LSA has flood-reduction feature enabled) is not mirrored to backup routing protocol process (rpd). In this situation, after performing nonstop active routing (NSR) switchover, the LSA on new master rpd remains without "DoNotAge" bit set. Once the LSA reaches OSPF max age, the router will flood LSA purge hence route flapping might be seen on all routers under the OSPF topology. PR1131075
In rare condition, mt tunnel interface flap cause backup Routing Engine core. The exact root cause is not known. While processing updates on the backup Routing Engine (received from master Routing Engine), accessing free pointer cause the core. PR1135701
On dual Routing Engine (Routing Engine) platforms with Bidirectional Forwarding Detection (BFD) protocol enabled, after graceful Routing Engine switchover (GRES), the periodic packet management process (ppmd) might crash on backup Routing Engine due to a software defect. PR1138582
RPD generates core files while processing PIM hellos. There is no known workaround for this problem. RPD core seems to happen sometimes when a *g and sg's vanishes mostly due to LHR becoming a Non-DR from a DR. PR1140230
With NSR configured, when the BFD sessions are replicated on backup Routing Engine, the master won't send the source address, instead backup Routing Engine will query the kernel to get the source address. In rare cases, the query might fail, resulting in the source address as all zeros. Later, if a GRES switchover happens, new master will have this all zeros source address. When BFD packet with this source address is send out, the other end will drop the BFD session due to no matching session (source address). PR1145612
In the BGP labeled unicast environment, the secondary route is configured with both add-path and advertise-external. If the best route and secondary route are changed in a routing table at the same time, add-path might miss to readvertise the changed route. The old route with the old label is still the last route advertised to one router instead of updating the advertisement with the new route and new label. So the traffic forwarding might be affected. PR1147126
This core is seen because of incorrect accounting of refcount associated with the memory block which composes the nhid (IRB nh). When the refcount prematurely reaches to 0 we released the memory block while it was still referenced from a route. We may see this issue when mcsnoopd becomes a slow consumer of rtsock events generated by rpd (nexthop events in the current case) and messages get delivered in a out-of-order sequence causing the refcount to be incorrectly decremented. In the testbed where the issue was reported, tracing was enabled for mcsnoopd (for logging all events) causing it to become a slow consumer. However, it may become slow also for other reasons such as processing very high rate of IGMP snooping reports/leaves which could potentially trigger this to issue. PR1153932
Core seen when BMP station was passive, and the BMP Collector was terminated non-gracefully, and BMP station was not properly cleaned up. PR1154017
Routing Policy and Firewall Filters
When a malformed prefix is used to test policy (command "test policy <policy name> <prefix>"), and the malformed prefix has a dot symbol in the mask filed (e.g., x.x.x.x/.24), the rpd process might crash. PR1144161
From Junos OS Release 13.2R1, an attempt to commit a configuration with a dangling conditional policy referring a non-existent/inactive routing-instance will be permitted. If we have a conditional policy referring an active routing-instance, deleting/deactivating this routing-instance and then committing will cause the rpd process crash. As a workaround, we should always make sure that conditional policies are referring active routing-instances. PR1144766
Services Applications
The LCP state for tunneled subscriber is incorrectly displayed as "OPENED" (which reflects the LCP state before tunneling) by CLI command "show interfaces pp0.<unit>" on the LAC. This issue will be fixed from 15.1R3. As a workaround, we can use "show ppp interface pp0.<unit>" command to determine the correct LCP state for the subscriber. PR888478
When polling to jnxNatSrcNumPortInuse via SNMP MIB get, it might not be displayed correctly. PR1100696
Junos OS Release 13.3 and later releases, when configuring a /31 subnet address under a nat pool, the adaptive services daemon (SPD) will continuously crash. PR1103237
SIP one way audio calls when using X-Lite SIP Softphone, in case that SIP media is switched to another media gateway though a SIP RE-Invite message. PR1112307
In CGNAT environment, when a service PIC is in heavy load continuously, there might be a threads yielding loop in CPUs, which will cause the CPU utilization high, and might cause one the CPUs to be reset. PR1115277
In CGNAT scenario, when we establish simultaneous TCP connects, we need to install timers for each TCP connection/flow. Due to this bug, we ended up creating two timers for the forward and reverse flow separately. Ideally there needs to be only one timer for both the forward and reverse flow. Whenever the session used to get deleted due to timer expiry, the PIC used to crash whenever the code tried to delete the same flow again. PR1116800
The Point-to-Point Tunneling Protocol (PPTP) ALG is used for tunneling Point-to-Point Protocol (PPP) packets over an IP network. But if the router configures session-limit-per-prefix, the PPTP-ALG does not work. PR1128484
In L2TP environment, the max pass-through (or transit) sessions is 8192, due to there will be a delay to remove the session when receiving PADT messages from client, if there are mess logout during a short time, the limit might be reached and the jl2tpd will crash. This issue will affect the L2TP subscribers who is trying to login, the existing subscribers will not be affected. PR1132285
With the following steps: 1) Define a RADIUS access profile with RADIUS which non-reachable from router 2) Run test access profile command 3) Abort using ctrl-c the l2tpd process will crash in few seconds. The existing active destinations, tunnels and sessions data will be recovered after the l2tpd restart. PR1155345
Software Installation and Upgrade
In certain conditions, when /var is not mounted from a persistent filesystem, executing a Junos OS upgrade will have unexpected results. This is caused by an inexact check of whether it is running from an Emergency VAR. PR1112334
Subscriber Management and Services
When the MX Series router acting as the Policy and Charging Enforcement Function (PCEF) uses Gx-Plus to request service provisioning from the Policy Control and Charging Rules Function (PCRF), the authentication service process (authd) might crash during the subscribers logout. PR1034287
In a subscriber management environment, after scaling subscribers login/logout multiple times, the MX Series routers may hang the subscriber in the terminated state and be stuck in the backup accounting queue. The reason is that, when the authentication daemon (authd) is trying to fetch data from the session database (SDB), an error (for example, session not found, or an SDB deadlock or during the SDB recovery period) may occur, and this error may cause the router to fail to notify the client daemon to clean up the service records. In this case, the subscribers may not able to send Acct-Stop messages to the RADIUS server and end up staying in a terminated state. PR1041070
This issue was introduced as part of another fix. Please contact JTAC for the recommended release for your deployment. PR1049955
In the PPP environment, when a subscriber is logged out, its IFL index is freed, but in rare conditions the session database (sdb) entry is not freed. When the IFL index is assigned to a new IFL, it is still mapped to an old sdb entry, so the jpppd process might crash because of mismatching. The issue is not really fixed, developer just adds some debug information. PR1057610
When using Neighbor Discovery Router Advertisement (NDRA) and DHCPv6 prefix delegation over PPPoE in the subscriber access network, if a local pool is used to allocate the NDRA prefix, when the CPE send DHCPv6 solicit message with both Internet Assigned Numbers Authority (IANA) and Identity Association Prefix Delegation (IAPD) options, the subscriber might get IPv6 prefix from the NDRA pool but not the delegated pool. As a workaround, the CPE should send DHCPv6 solicit message with only IAPD option. PR1063889
On MX Series platforms, in subscriber management environment, when receiving Activate-Service Vendor Specific Attributes (VSA) or Deactivate-Service VSA (for example, included in CoA-Request) from RADIUS server, the strings are parsed and empty "()" are stripped off service names, also, any white spaces are removed. Due to this reason, the service accounting message (e.g. "Accounting-Request") sent by the router (to the RADIUS server) does not contain empty "()" even if the strings were received in this way. As a workaround, changing RADIUS server to accept the service accounting message string without the "()" or the white space if possible. PR1066709
In subscriber management environment with Remote Authentication Dial In User Service (RADIUS) server configured, when performing scaling subscribers login/logout, the device may stuck in RADIUS communication. PR1070468
In subscriber management environment, the PPP daemon (jpppd) might crash repeatedly due to a memory double-free issue. PR1079511
Activating and Deactivating services in same CoA-Req packet might fail to be executed on BNG router. Please note this issue will not be seen if there is no SRL service activated/deactivated request in this CoA. PR1088366
In subscriber management environment with three or more radius-servers connected to an MX Series router, when AAA sends a request to one radius-server, if that particular request and all retries timeout, AAA records the time. For next request, AAA incorrectly uses the recorded time and marks that radius-server down even before trying to send out the request. PR1091157
Radius backup accounting queue is used to store radius records while the radius server is not alive. Draining this queue when the server is reachable again should not log any critical message as this is normal operation. PR1097491
On MX Series platforms, when using RADIUS dynamic requests for subscriber access management, if the device detects that the CoA-Request it received is same with the one in processing progress, the router would send CoA-NAK packet back to the RADIUS server with incorrect code 122 (invalid request) incorrectly. In this case, the router should return VSA with value "100 In Progess". PR1100550
FFP is a generic process that will be called during commit process, and FFP calls the PDB initialization as part of its process. On the PDB-unsupported platforms (MX Series, EX9200, M10i, M120, M320 is PDB-supported), when committing configuration, some error messages will be seen. PR1103035
In subscriber management environment, on MX Series platforms, if the configuration statement "last-statistics-when-unavailable" is configured, after the unrecoverable error, libstats is expected to not sending stats anymore, however, it is not the case here, the device may still send service interim-accounting message in wrong time-intervals to the RADIUS server. PR1105954
On MX Series platforms, when using the DHCPv6 prefix delegation over PPPoE, if the RADIUS allocates a DHCPv6 pool name during the authentication of subscribers and "on-demand-ip-address" feature is enabled in a dynamic-profile, the prefixes may not be cleared by authentication process (authd) after disconnecting the subscribers. PR1108038
When PPPoE sessions with Extensible Subscriber Services Management Daemon (essmd) subscribers configured, after terminating some PPPoE sessions without essmd service and executing a routing-engine switch, some PPPoE sessions cannot be set up. After terminating all sessions, some sessions are stuck in Terminating. The logout is queued because a Change of Authorization (CoA) is in progress and never complete. PR1111062
On MX Series platforms, in subscriber management environment, if the sequence of event happens as following: the authentication process (authd) sends dynamic-profile service acct-start request to the Radius server (this is the service activated at login), then the CoA (for example, is used to activate the ESSM service) arrives at authd before the acct-start response, so the authd starts processing the CoA before processing the acct-start response, then during the processing of the acct-start response, the CoA, now in process, is deleted leaving authd with no way to answer the CoA request. As a result, the Radius server times-out and eventually sends a Disconnect request to authd, authd will deactivate any active services and deletes all of the subscriber's service entries (since the ESSMD services are not in the 'Active' state, so they are only deleted), at this point, the business 'subscribers' (interfaces) are orphaned and 'stuck'. The issue may be avoided by delaying the CoA requests by enough time to allow the authd to receive the acct-start responses for login. PR1112323
When multiple authentication or accounting Radius servers are configured and if one of the servers is down/not-reachable, the Access-Request messages will be queued to the next Radius server no matter its "max-outstanding-requests" is reached or not. In case that all the Radius servers reached its "max-outstanding-requests", the new requests should be queued to an internal queue but they are queued to the last Radius server. As a workaround, we can use only one Radius server or make sure all the Radius servers are reachable. PR1122703
In subscriber management environment, the authentication process (authd) crash may occur. This issue is not reproduced yet, possibly, it might be seen when generating a CLI Change of Authorization (CoA) request (e.g., via CLI command "request network-access aaa subscriber add service-profile filter-service session-id 10"), then logging out the subscriber (the one with service just activated), if the management CLI session is closed before subscriber entry is reused, the crash may occur. PR1127362
In subscriber management environment with AAA authentication, after a few rounds of login/logout, some dynamic PPPoE subscribers might stuck in configured (AuthClntLogoutRespWait) state. PR1127823
On MX Series platforms, with "subscriber-management" enabled, the authd process might crash during subscribers concurrent login/logout. When authd process crash, the new subscribers might not login. But all connected subscribers remain connected. The authd process will restore in a short time, then new subscribers could login successfully. PR1128622
For Point-to-Point Protocol over Ethernet over ATM (PPPoEoA) subscribers, during subscriber bringing down, the assigned IFL unit number is not correctly retrieved, so it can cause premature unit number exhaustion and thus fails to resolve &junos-interface-unit/ &junos-interface-name variables. PR1137723
When class attribute is changed for a subscriber via COA, existing subscriber services continue to use the class attribute value at the time when that service was created. Updated class attribute value will take effect for the subscriber and the services created there. When both service and class attributes are present in COA request, AUTHD first processes the service requests and then processes class attribute. Due to this, accounting starts for requested services does not contain updated class attribute. PR1143083
In normal BRAS environment, if the radius queue is presently full, MX BRAS might stop send accounting messages and customer might see "Radius result is CLIENT_REQ_MAXED_OUT" in authd log messages. PR1152052
User Interface and Configuration
Junoscript traceoptions are available. PR1062421
When committing a configuration with very long as-path, in this case the as-path is almost 12000 characters long, the commitd process might crash. The commitd process restart results in a minimal impact of system. As a workaround, please configure as-path less than 4096 characters long. PR1119529
While using wildcard with interface like "set groups <group name> interfaces <xe> unit <unit>", there is no "disable" option followed. PR1137377
When there are two or more sessions accessing the router, and one of the session (for example, session 1) is executing commit check in configuration private mode, if another session (for example, session 2) is keep executing commit and-quit in configuration private mode, because the commit check is not keeping the lock on local Routing Engine for entire session, there is a chance that session 2 will hit a Database opening error. The detailed sequence events are as following: (1) Session 1: commit check is not keeping the lock on local Routing Engine for entire session, once commit check on local is success, while it asked for lock on other Routing Engine. (2) Session 2: mgd acquired db lock on local Routing Engine. (3) Session 1: once commit check is completed on remote Routing Engine, it does cleanup and deleted the juniper.data+ (created by Session 2). (4) Session 2: juniper.data+ is still in use at local Routing Engine for by daemons and daemons start complaining about it and emitted the messages as "Database open failed for file '/var/run/db/juniper.data+' ". PR1141576
VPNs
In NG-MVPN network, if there is a device working as PE which uses PIM, GRES/NSR Routing Engine switchover might cause multicast traffic loss. PR1086129
In scenario involving pseudowire redundancy where CE facing interface in the backup neighbor (can be non-standby, standby, hot-standby type), if the virtual circuit (VC) is not present for the CE facing interface, the CE facing interface may go up after committing an unrelated VC interface configuration (e.g., changing description of another VC interface) even though the local pseudowire status is in down state. PR1101886
On dual Routing Engine platform with BGP L2VPN and NSR configured, there might be a chance that the block label allocation and deletion for L2VPN is out of order on backup Routing Engine as following: Master rpd follows the below sequeces (which is the correct order): Add Prefix P1 of Label L1 Delete Prefix1 of Label L1 Add Prefix P2 of Label L1 However, on backup rpd, it goes like this: Add Prefix P1 of Label L1 Add Prefix P2 of Label L1 <====== Delete Prefix1 of Label L1 In this situation, backup rpd cannot allocate the label L1 for P2 since L1 is already in use for P1, so it crashes. This occurs in scaling environment (10k L2VPN) where the router has multiple BGP peers and different L2VPN routing-instances are deleted and added back. PR1104723
In Global Table Multicast (GTM) scenario (instance-type mpls-internet-multicast), when the GTM instance and master instance are used, if the name of the GTM instance is changed, the routing protocol process (rpd) may crash due to the usage of the incorrect routing table handle. PR1113461
In L2circuit environment, if one PE has pseudowire-status-tlv configured but remote has not, and at the same time, this PE does not support control-word but remote does, then it will not send changed local status code to remote PE, in a rare condition, after enable status-tlv support at remote end, the l2circuit might stuck in "RD" state on remote PE. PR1125438
In next-generation multicast virtual private network (MVPN) scenario, the rpd process will crash on the PE router after receiving PIM join messages from local receivers if "nexthop-hold-time" is configured in this local VPN routing and forwarding (VRF). As a workaround, we can disable "nexthop-hold-time" to avoid this issue. PR1131346
Resolved Issues: 15.1R2
Class of Service (CoS)
For an ATM interface configured with hierarchical scheduling, when a traffic-control-profile attached at ifd (physical interface) level and another output traffic-control-profile at ifl (logical interface) level, flapping the interface might crash the FPC. PR1000952
In SNMP environment, when performing multiple walks or parallel snmpget for same interface at the same time (for example, SNMP bulk get/walk, or SNMP polling from multiple devices) on CoS related MIBs (jnxCos table), if the interface state changes or the request times out when FPC is responding the request, memory leak of Class-of-Service process (cosd) about 160 bytes (up to 1500 bytes) may occur, which may cause cosd to crash eventually when limit is exceeded. PR1058915
On MX Series platform, when aggregate Ethernet (AE) interface is in link aggregation group (LAG) Enhanced mode, after deactivating and then activating one child link of the LAG , the feature that runs on AE interface rather than on the child link (for example, IEEE-802.1ad rewrite rule) may fail to be executed. PR1080448
After restarting chassisd or doing an in-service software upgrade from 13.2R8.2 to 13.3R7.3, results in the following messages seen in syslog: cosd_remove_ae_ifl_from_snmp_db ae40.0 error 2 Messages appear to be harmless with no functionality impact. PR1093090
On MX104 platform, when we configure rate-limit for the logical tunnel (lt-) interface, the commit will fail. As a workaround, we can use firewall filter with policer to achieve the same function. PR1097078
On MX Series platforms, when class-of-service (CoS) adjustment control profiles and "overhead-accounting" are configured, if the ANCP adjust comes before the logical interface (logical interface) adding message and the logical interface is in "UP" state when added (for example, it may occur when carrying scaling subscribers, for instance, 8K subscribers). For some of the subscribers, the local shaping rate from dynamic profile for the subscriber logical interface may not be overridden by shaping-rate of ANCP. PR1098006
When performing the Routing Engine switchover without GRES enabled, due to the fact that the Class-of-Service process (cosd) may fail to delete the traffic control profile state attached to logical interface (IFL) index, the traffic-control-profile may not get programmed after the logical interface index is reused by another interface. PR1099618
Forwarding and Sampling
When there are no services configured, datapath-traced daemon is not running. In the PIC, the plugin continues to try for the connection and continuous connection failure logs are seen. PR1003714
In IP security (IPsec) VPN environment, after performing the Routing Engine switchover, the traffic may fail to be forwarded due to the SAs may not be downloaded to the PIC, or due to some security associations (SAs) on the PIC may incorrectly hold references for old Security Policy Database (SPD) handles while SPD has deleted its entries in the Security Association Database (SAD). PR1047827
On all Junos OS based platforms, there are two different types of memory blocks that might be leaked. The first issue is rpd-trace memory block leak. There is one block each for any trace files opened for rpd. They could be leaked for each time a configuration commit is done. Around 40 bytes are leaked per operation. The issue does not occur in Junos OS Release prior to 14.1. The second issue is rt_parse_memory block leak which could happen during the configuration of aggregate routes, configuration information might not be freed. Around 16384 bytes are leaked per operation. This issue is a day-1 issue. PR1052614
When enabling pseudowire subscribers the "show subscribers extensive" command does not display CoS policies applied to the subscriber interface. This issue was fixed in 13.3R6, 14.1R5 and 14.2R3. PR1060036
For MX Series Virtual Chassis (MX-VC) with scaled subscribers, for example, 100K DHCP/20K PPPoE subscribers. If the Virtual Chassis port (VCP) FPCs also house the uplink ports and the "indirect-next-hop-change-acknowledgements" and "krt-nexthop-ack-timeout" configuration statements are configured along with the protection mechanism, after the master Routing Engine in the Virtual Chassis master router (VC-Mm) is powered down, the traffic loss and subscriber loss might be observed due to the indirect next-hop change acknowledgement timeout. With this fix, the upper limit for "krt-nexthop-ack-timeout" is changed from 100 seconds to 250 seconds. PR1062662
For MX-VC platform, performing unified ISSU in scaled subscribers environment might cause all VC members to get restarted unexpectedly. PR1070542
After rebooting the BNG with scaled subscribers, a dynamic-profile add request might fail, causing bbe-smgd (subscriber management daemon) to crash, then some subscribers might fail to login. PR1071850
Juniper Networks device is not sending an error code to the Open vSwitch Database (OVSDB) client when the commit fails. Now a graceful mechanism is introduced to handle netconf configuration errors. If a netconf commit fails, the transaction will be routed to a failed queue. The transaction remains in the failed queue, until the user takes action to explicitly clear the transaction from the failed queue using the CLI. New CLI commands to show and clear failed netconf transactions. user@router> show ovsdb netconf transactions Txn ID Logical-switch Port VLAN ID 1 vlan100 user@router> clear ovsdb netconf transactions PR1072730
On MX Series-based platform, when the Layer 3 packets destine to an Integrated Routing and Bridging (IRB) interface and then hit the underlying Layer 2 logical interfaces (IFLs), due to the egress feature list of the Layer 2 logical interfaces may get skipped, the features under the family bridge (for example, the firewall filter) on the Layer 2 interfaces may not be executed. PR1073365
The issue is seen while moving an interface from one mesh group to another. PR1077432
In scaled subscriber management environment (for example, 3.2K PPPoE subscribers), after heavy login/logout, the session setup rate keeps decreasing and also PAP-NAK messages are sent with "unknown terminate code". This continues till Broadband Network Gateway (BNG) does not accept PPP sessions and all newly incoming sessions are stuck in PAP Authentication phase (No PAP ACK received). PR1075338
The license-check process may consume more CPU utilization. This is due to a few features trying to register with the license-check daemon which license-check would not be able to handle properly and results in high CPU on Routing Engine . Optimization is done through this fix, to handle the situation gracefully so that high CPU will not occur. PR1077976
From Junos 14.1R1, if the hidden configuration statement "layer-4 validity-check" is configured, the Layer4 hashing will be disabled for fragmented IP traffic. Due to a defect, the Multicast MAC rewrite is skipped in this case, the fragmented multicast packets will be sent with incorrect destination MAC. PR1079219
The rpd process might crash on both master and backup Routing Engines when a routing instance is deleted from configuration, if the routing instance is cleaned up before the interface delete is received from device control daemon (dcd). This is a rare timing issue. PR1083655
OTN based SNMP Traps such as jnxFruNotifOperStatus and jnxIfOtnNotificationOperStatus are raised by offline/online MIC although no OTN interface is provisioned. PR1084602
Invalid Ethernet Synchronization (ESMC) frames may be transmitted by MX router when activating LAG and tag-protocol-id under interfaces. PR1084606
On a device with lt and ams interfaces configured, walking ifOutOctets or other similiar OID's may cause a "if_pfe_ams_ifdstat" message to print. This is a cosmetic debug-level entry, which was incorrectly set to critical-level. PR1085926
In the specific configuration of a LT interface in a VPLS instance and the peer-unit of this LT interface configured with family inet6 using vrrp, the kernel may crash when the FPC is online. PR1087379
On MX Series based line card, if a rlsq interface is receiving continuous fragmented traffic, doing rlsq switchovers couple of times might cause FPC to crash and reboot. PR1088300
In rare cases, SSH or telnet traffic might hit incorrect filter related to SCU (Source Class Usage) due to the defect in kernel filter match. This issue comes when the filter has match condition on source class ID. PR1089382
In rare cases, MX Series routers might crash while committing inline sampling related configuration for INET6 Family only. PR1091435
In a fib-localization scenario, IPv4 addresses configured on service PICs (SP) will not appear on FIB-remote FPCs although all local (/32) addresses should, regardless of FIB localization role, install on all Packet Forwarding Engines. There is no workaround for this and it implies that traffic destined to this address will need to transit through FIB-local FPC. PR1092627
There are entries for PEM in jnxFruEntry in VMX. It is not necessary and is cosmetic. PR1094888
When route convergence occurred, the new gateway address is not updated correctly in inline-jflow route-record table (route-record table is used by sampling), and the sampling traffic forwarding might be affected, but normal routing would be not affected. PR1097408
After upgrading to Junos OS Release 14.1R1 and higher, loopback ISO family address may be stuck in KRT queue. PR1097778
When BGP multipath is enabled in a Virtual Routing and Forwarding (VRF), if "auto-export" and "rib-group" are configured to leak BGP routes from this Routing Instance table to another, for example, the default routing table, then traffic coming from the default routing instance might not be properly load balanced due to the multipath-route leaked into the default routing table is not the active route. This is a random issue. As a workaround, only use "auto-export" to exchange the routes among the routing tables. PR1099496
General Routing
There is hardware design flaw with 2x10GE MIC and 4x10GE MIC today which introduces +/-6.2ppm frequency offset for SyncE operation. In order to correct this, the framing of the PIC and interface has to be matched (which will not be by default). PR932659
SNMP MIB walk of object "jnxSpSvcSet" gives hardcoded value as "EXT-PKG" for SvcType. PR1017017
With Multiservices MPCs (MS-MPCs) or Multiservices MICs (MS-MICs) installed on MX Series platform, when trying to view the Network Address Translation (NAT) mappings for address pooling paired (APP) and/or Endpoint Independent Mapping (EIM) from a particular private or a public IP address, all the mappings will be displayed. PR1019739
On MX Series router with MPC3E/MPC4E/MPC5E/MPC6E if the Packet Forwarding Engine has inline NAT configured or is processing inline GRE decapsulation with packet-sizes between 100B-150B, in some very corner cases, traffic blackhole might be seen due to incorrect cell packing handling. On T4000 with FPC type 5, when these cards are processing any packets sizes between 133B-148B in certain sequences causes incorrect cell packing handling. PR1042742
Queue stats on LSQ interfaces are not properly cleaned up when queuing enabled on the Physical Interface and the queues hosted at Physical Interface level. This happens when a subsequent delete and create of LSQ interface (not always though) - 14.1R4.10. PR1044340
MPC with Channelized OC3/STM1 (Multi-Rate) Circuit Emulation MIC (MIC-3D-4COC3-1COC12-CE) might crash. This problem is very difficult to replicate and a preventive fix will be implemented to avoid the crash. PR1050007.
In subscriber management environment, the Berkeley Database (DB) may get into deadlock state. It is brought on by multiple processes attempting to simultaneously access or update the same subscriber or service record. In this case, due to the access to DB were blocked by device control daemon (dcd), the subscriber management infrastructure daemon (smid) fails to recover the DB. Consequently, the router may stop responding to all the login/logout request as well as statistics activity. This timing related issue is most likely to occur during login or logout and when the system is busy. PR1054292
With inline L2TP IP reassembly feature configured, the MX Series routers with MPCs/MICs might crash due to a memory allocation issue. PR1061929
In subscriber management environment, if IPv6 family is not enabled in the dynamic profile, the IPv6 Router Advertisement message will not be sent through the dynamic subscriber interface. As a workaround, you can enable family inet6 in the dynamic profile. PR1065662
When setting the syslog to debug level (any any), you may note reoccurring messages of the form "ifa for this rt ia is not present, consider ifa as ready". These messages are logged for IPv6 enabled interfaces when receiving forwarded packets and cause no harm. Set a higher debug level to avoid seeing them. PR1067484
The static route prefers the directly connected subnet route for resolving the nexthop rather than performing a longest prefix match with any other available routes. In case of longest prefix route being desired in customer deployment, it will result in traffic loss issue. Now a new configuration statement "longest-match" is introduced to enable longest prefix matching behavior when desired: set routing-options static route <destination prefix> next-hop <address> resolve longest-match. PR1068112
In subscriber management environment, changing the system time to the past (for example, over one day) may cause the processes (for example, pppoed, and autoconfd) that use the time to become unresponsive. PR1070939
Higher baseline CPU utilization and periodic CPU spikes might be seen on XM-based MPC as compared to MPC-3D-16XGE-SFPP Cards due to the following reasons: On MX-based MPC, low priority threads which monitor various things in the background on a periodic basis such as voltage, temperature, stats counters, hardware status exist. When the system is idle, these threads are allowed to take more of the load and that is why higher baseline CPU/CPU spikes are seen. This does not prevent other higher priority threads from running when they have to, as these are non-critical activities being done in the background and hence it is a non impacting issue. PR1071408
Traffic throughput test between MPC1/1E/2/2E card and MPC2E/3E NG card, the flowing from MPC1/1E/2/2E card to MPC2E/3E NG card is lesser then from MPC2E/3E NG card to MPC1/1E/2/2E card. PR1076009
Vendor provided the fix, which includes conditional check. PR1076369
In a Q-in-Q setup, if outer vlan tag is coming with EtherType 0x88a8, it is not possible to create dynamic vlan interface on Junos 13.1X42 or 14.1X51 releases. PR1080734
On MX Series platform with MS-MPC/MS-MIC, in some mspmand process crash scenarios, after the mspmand coredump is finished or almost finished, PIC kernel also crashes and dumps vmcore. The mspmand generates core files in these scenario are readable but vmcores are not. PR1081265
In DHCPv6 prefix delegation over PPPoE scenario, when forwarding the control packet from the Routing Engine to the DHCPv6 identity association for prefix delegation (IA_PD) address over PPPoE, for instance, executing ping from Routing Engine targeting the client's PD address, the traffic may get dropped on the device. PR1081579.
If a router has Service PIC equipped but without any Service PIC specific configurations, the CPU usage on this PIC/FPC might be high. Have some configurations under below configuration statement could prevent from this issue: [system processes process-monitor traceoptions] OR [chassis fpc <fpc slot> pic <pic slot> adaptive-services service-package extension-provider] OR [services] PR1081736
In multi-homing and signal active EVPN scenario, if IRB interface is included in the instance, when the DF-CE link flaps, due to a timing issue, the DF might send L3 EVPN routes with label 0 to remote PEs, causing traffic to be dropped at remote PE. PR1082287
'show interfaces queue <ifl>' stats are not correct with RLSQ warm-standby mode. Issue seen on MPCs and MICs as well in 14.1R4.10. PR1082417
TCP messages do not have their MSS adjusted by the Multiservices MIC and MPC if they do not belong to an established session. PR1084653
With a scaled subscribers system, repeatedly doing tcpdump of subscriber interface and press ctrl+c might cause bbe-smgd daemon memory growing, which will in turn causing crash, SDB corruption and some other daemons crashing. Following signs may be seen when this problem is hit: log messages like: "/kernel: cmd bbe-smgd pid 1997 tried to use non-present sched_yield" tcpdump stops working bbe-smgd no longer accepts new vty sessions. PR1085944
In some rare conditions, depending on the order in which configuration steps were performed or the order in which hardware modules were inserted or activated, if PTP master and PTP slave are configured on different MPCs on MX Series router acting as BC, it might happen that clock is not properly propagated between MPCs. This PR fixes this issue. PR1085994
MACsec using static secure association key (SAK) security mode does not work properly on MX80 routers and FPC slots other than slot 0 of MX104 routers. PR1086117
mspmand.core is observed while making ms-mic offline with IPsec and Jflow configured on same ms-mic with dynamic IPSEC tunnels. PR1086819
If the ALG is receiving UDP fragmented control traffic (e.g. SIP control packets) continuously, the mspmand process (which manages the service PIC) might crash due to buffer error. PR1087012
Due to a change in an existing PR, group names in the configuration must be a string of alphanumericals, dashes or underscores. There is not workaround other than following the group name instructions.PR1087051
On LAC (L2TP Access Concentrator) router with session client-idle-timeout configured, the tunneled PPP session will always keep active due to the PPP control messages are accounting as user data. PR1088062
Wrong ESH checksum computation with non-zero Ethernet Padding in Juniper MX Series router. PR1091396
The mspmand process might crash due to prolonged flow-control with TCP ALGs under the following possible scenario, mostly when the following conditions happen together: 1. When the system is overloaded with TCP ALG Traffic 2. There are lots of retransmissions and reordered packets. PR1092655
When the control path is busy/stuck for service PIC, the AMS member interface hoisted by it might be down, but when the busy/stuck condition is cleared, the member interface might not recover, and AMS bundle still shows the PIC as inactive. PR1093460
On TCP ALG, if there are a lot of retransmissions and reordered TCP packets, and the system is overloaded due to the TCP traffic, the mspmand (which manages the service PIC) process might crash. PR1093788
In a scaled Broadband Subscriber Management environment (in this case, 16K subscribers), when Access Node Control Protocol (ANCP) CoS adjustment is configured, the minimum rate instead of the shaping-rate might be wrongly applied to some subscribers and causes traffic loss. PR1094494
Extensive Header integrity checks will be done for packets which match a service set which has NAT/SFW configured. 1. Enable Header integrity checks by default when SFW or NAT is configured in same service set. This is inline with ukernel behavior 2. Retain the configuration statement for use by other plugins such as IPsec which may want to enforce header integrity if needed 3. Ensure that the cmd "show services service-sets statistics integrity-drops" works if sfw/nat is configured PR1095290
The issue is because of the software problem. Just after the system reboots, rpd process is determining the Routing Engine mastership mode too early before chassisd is determining the mastership , which would cause overload feature to not work properly. PR1096073
If a service-PIC is configured to simultaneously function as both an MS interface and as a member of an AMS interface, then some settings under services-options may not apply correctly. These settings are A) syslog_rate_limit, B) fragment-limit, C) reassembly-timeout and D) jflow_log_rate_limit. PR1096368
For Junos 13.3R1 and later, the DPC card might experience a performance degradation when it's transferring bidirectional short packets (64B) in inline rate. PR1098357
Some of the new revisions (for example, REV 30, REV 31) of the MICs can not come up with NG-MPC2 or NG-MPC3 line card. We can check the MIC version by CLI command "show chassis hardware detail | no-more". root@user> show chassis hardware detail | no-more Hardware inventory: Item Version Part number Serial number Description .. FPC 2 REV 14 750-054901 CADJ3871 MPC3E NG PQ & Flex Q CPU REV 11 711-045719 CADN5465 RMPC PMB MIC 0 REV 30 750-028392 CAEB9203 3D 20x 1GE(LAN) SFP <<<<<<REV>PR1100073
When the null pointer of jbuf is accessed (jbuf, that is, a message buffer is allocated only when the packet is ready to process. The buffer is freed after the packet completes ALG handling is accessed), for example, when using the Microsoft Remote Procedure Call (MS RPC) (as observed, issue may also happen on Sun Microsystems RPC) Application-level gateway (ALG) with NAT (stateful firewall is used as a part of the service chain), if the traffic matching configured universal unique identifier (UUID) is arrived on the ALG, the mspmand (which manages the Multiservice PIC) crash occurs. PR1100821
In broadband edge (BBE) environments, for example, if the interface-set is created corresponding to SVLAN, then multiple logouts and logins will create a new interface-set index. When the interface-set index range goes above 65535, executing CLI command "show interfaces interface-set queue egress" will cause 100% CPU usage. As a workaround, we can use the specified interface-set name instead of using the wildcard. PR1101648
On MX dual Routing Engine platforms, if there are a large number of addresses (in this case, there are > 500 addresses configured, the issue might be observed around 472 addresses) configured on lo0.0, when the Broadband Edge subscriber management daemon (bbe-smgd) replicating these addresses to the standby Routing Engine, the internal 8K replication buffer may get exceeded. Due to this failure, memory leak (around 45MB every time error is encountered) may occur when bbe-smgd tries to delete the object. Since lo0.0 object gets created/destroyed over and over, bbe-smgd runs out of memory and crash eventually. PR1101652
Non-queuing MPC5E might crash continuously if rate-limit under transmit-rate for scheduler is applied. As a workaround, do not configure rate-limit and use firewall policer for forwarding-class instead. MPC5EQ is not exposed. PR1104495
On MX Series platform, the output of CLI command "show system subscriber-management route" may be shown as empty. PR1104808
An IPv4 filter configured to use the filter block with term that has both "from precedence" and another non 5-tuple (i.e. not port, protocol, address) will cause an XL/EA based board to reboot. Example: set firewall family inet filter FILTER fast-filter-lookup set firewall family inet filter FILTER term TERM from precedence PRECEDENCE set firewall family inet filter FILTER term TERM from tcp-established. PR1112047
High Availability (HA) and Resiliency
On dual Routing Engine platforms with NSR enabled, when committing scaling configuration (for example, deactivating 500 logical interfaces and performing commit, then activating 500 logical interfaces and commit, the process may need to be performed 3-6 times) to the device, the master Routing Engine would be busy processing commit, due to which the backup does not get data or keepalive from master. In this situation, the protocols (for example, OSPF, or LDP) may get down on the backup Routing Engine due to keepalive timeout. PR1078255
Interfaces and Chassis
Chap Local-name default to 8 characters. Should be 32. PR996760.
On MX Series platform with large-scale PPPoE subscribers (more than 60k) connected, PPP client process (jpppd) might crash and generate core files when performing RE switchover.PR 1018313
If a subscribers-facing AE interface has link protection enabled, offline the primary child link hosted FPC might cause some subscribers to down. PR1050565
dcd will crash if targeted-distribution applied to ge ifd via dynamic-profile. PR1054145
There is a mismatch in mac statistics, few frames go unaccounted. This is a day-1 issue with the software fetching of mac statistics, the snap and clear bits were setting set together on pm3393 chip driver software, so it used to so happen that even before the copy of stats to shadow registers happened, clear was happening which used to go unaccounted. PR1056232
When a dynamic PPPoE subscriber with targeted-distribution configured on a dynamic vlan demux interface over aggregated ethernet, the device control daemon (dcd) process might crash during a commit if the vlan demux has mistakenly been removed. The end users cannot visit internet after the crash. This is a rare issue and not easy to be reproduced. PR1056675
It is observed that the syslog messages related to kernel and Packet Forwarding Engine may get generated at an excessive rate, especially in subscriber management environment. Most of these messages may appear repeatedly, for example, more than 1.5 million messages may get recorded in 2 hours, and there are only 140 unique messages. Besides, these messages are worthless during normal operation and due to the excessive rate of log generation, it results in high Routing Engine CPU consumption (for example, Routing Engine CPU utilization can be stuck at 100% for a long time (minutes or hours), it depends on the activity of subscribers (frequency of logins and logouts) and on the AI scripts used by the customer) by event process (eventd) might be observed on the device. PR1056680
When "set chassis lcc 0 offline" is used on SCC and committed, the configuration gets synced on LCC. However when "delete chassis lcc 0 offline" is used on SCC, we need to do commit two times on SCC in order to sync the config on LCC being brought online. PR1058994
For transit traffic on INLINE LSQ redundancy (rlsq) interface, the input firewall-filter counters are logging zero packet count regardless of traffic flow. Output filter counters are logging correctly. For host-bound traffic, the firewall output counter will get double accounted on Classical rlsq and triple accounted on INLINE rlsq. This issue is targeted to be fixed in Junos 14.1R5. PR1060659
In scaling PPP subscriber environment, when the device is under a high load condition (for example, high CPU utilization with 90% and above), the long delay in session timeout may occur. In this situation, the device may fail to terminate the subscriber session (PPP or PPPoE) immediately after three Link Control Protocol (LCP) keepalive packets are missed. As a result, the subscriber fails in reconnect due to old PPP session and corresponding Access-Internal route are still active for some time. In addition to this, it is observed that the server is still sending KA packets after the session has timed out. PR1060704
For Junos OS Release 13.3R1 or above, after multiple (e.g. 26) iterations of graceful Routing Engine switchover (GRES), the TNP address of management interface might be deleted incorrectly during switchover, this leads to all FPCs being offline. PR1060764
Link Up/Down SNMP traps for AE member links might not be generated, but the SNMP traps for the AE bundle works well. PR1067011
In PPP-based subscriber management environment, after performing scaling subscribers login/logout, the subscribers might be stuck in terminating and terminated state because logout requests are not processed properly, and the Session Database (SDB) might get exhausted eventually after stuck subscribers exceeding 256000.PR 1073146
In PPP subscriber management environment, the jpppd process might crash for a timing issue. PR1074545
When the Ethernet Link Fault Management (LFM) action profile is configured, if there are some errors (refer to the configuration, for example, frame errors or symbol errors) happening in the past (even a long past), due to the improper handling of error stats fetching from kernel, the LFM process (lfmd) may generate false event PDUs and send false alarm to the peer device. PR1077778
On MX Series Virtual Chassis (MX-VC) platform, due to a timing issue, the physical interface (ifd) on the same Modular Interface Card (MIC) with Virtual Chassis port (VCP) might not be created or takes a very long time to be created after rebooting the hosted Modular Port Concentrator (MPC). PR1080032
MAX-ACCESS value has been changed in jnx-otn.mib for the following oids: jnxOtnIntervalOdu15minIntervalNumber jnxOtnIntervalOtu15minIntervalNumber jnxOtnIntervalOtuFec15minIntervalNumber The value has been changed from read-only to not-accessible to be inline with newer MIBs. PR1080802
On MX Series platform acting as broadband network gateway (BNG), in Point-to-Point Protocol (PPP) scenario, when using the Internet Protocol version 6 Control Protocol (IPv6CP) for negotiation, if the router receives an IPv6CP Configure-Request packet from client, MX BNG sends the Configure-Request packet, but does not send IPv6CP Configure-Ack packet, in case it does not receive the Configure-Ack that responding to the Configure-Request packet it sent. The behavior does not follow the RFC 1661, which demands both the actions Send-Configure-Request (i.e. IPv6CP-ConfReq from MX to client) and Send-Configure-Ack (i.e. IPv6CP-ConfAck from MX to client) to be conducted on the router without any significant delay. PR1081636
With Non-MX Series/service DPCs which are not supported with enhanced-ip, when these unsupported DPCs are in the chassis, the user switches to enhanced-ip and reboots the router, the router should come back up and the unsupported DPCs should stay powered off and not log any alarms. In this case, the non-supported DPCs stay powered off, but they are also continuing to raise alarms. There are two workarounds for this issue; first, power down the FPC prior to changing enhanced-ip mode; second, perform a hard restart by "restart chassis-control immediately" to restore. Both of these workarounds will impact traffic through the router. PR1082851
In MX virtual chassis (MXVC) scenario, during unified ISSU operation, the new master Routing Engine does not have the MXVC SCC's system MAC address. It just has its local system MAC address. The address is not replicated between local Routing Engines, and the new master Routing Engine is not yet connected to the MXVC SCC to receive it. Hence, the possibility of overwriting the FPC with an address that does not match the previous address exists. PR1084561
The VRRP preempt hold time is not being honored during NTP time sync and system time is changed. PR1086230
On MX Series Virtual Chassis (MX-VC) platform with "subscriber-management" enabled, after power up/reboot, the VC backup router (VC-B) experiences a rapid sequence of role transitions from no-role to VC master router (VC-M) to VC-B, the expected local GRES and a reboot of the former master Routing Engine might not happen on the VC-B. Some of the FPCs on it might be stuck in "present" state and eventually rebooted. PR1086316
Deactivating/activating logical interfaces may cause BGP session flapping when BGP is using VRRP VIP as the source address. This is caused by a timing issue between dcd and VRRP overlay file. When dcd reads the overlay file, it is not the updated one or yet to be updated. This results in error and dcd stops parsing VRRP overlay file. PR1089576
In the dual Routing Engines scenario with GRES and ae0 interfaces configuration, if GRES is disabled on system, the backup Routing Engine should remove the ae0 bundle, however it does not go clean and ae0 remains in backup Routing Engine. After switching Routing Engine mastership to make other Routing Engine as master, the new master Routing Engine (which was backup earlier) continues to use invalid MAC address "00:00:00:00:00:00". PR1089946
When an interface on SFPP module in MIC is set disabled, after pulling out the SFPP and then insert it, the remote direct connected interface might get up unexpectedly. PR1090285.
After removing a child link from AE bundle, in the output of "show interface <AE> detail", the packets count on the remaining child link spikes, then if add back the previous child link, the count recover to normal. PR1091425
For Junos OS version 14.1X51-D60 or 14.1X50-D105, when DHCP local server is configured, the DHCP subscribers might be unable to come up. PR1092553
In MX Series Virtual Chassis (MXVC) environment, when rebooting the system or the line cards which contain all the Virtual Chassis port (VCP) links, because line cards might fail to complete the rebooting process within 5 minutes, the timer (that is, the amount of time allowed for the LCC to connect to the SCC) started by the master router might expire which may cause the VCP links establishment failure. In addition, this issue is not specific to the line cards type, based on the observation, the timer (5 minutes) may expire on a MX2020 with all 20 FPCs equipped as well. PR1095563
On PB-2OC12-ATM2-SMIR PIC, port 0 and port 1 are configured with clock source as external, if Loss of signal (LOS) is inserted on port 0, the port 0 will go down, the expected behavior is clock being used from port 1. But in this case, port 0 down will results in port 1 flapping and reporting SONET phase lock loop (PLL) errors. PR1098540
In VRRP environment, with VRRP configured over double tagged interface and VRRP delegate-processing enabled, the PDUs are generated with only one tag and the outer tag is not added, because of which, the PDUs will get dropped at the receiving end. The similar configuration that may cause the issue might be seen as below, .. protocols { vrrp { delegate-processing; <<<<< "delegate-processing" is enabled for VRRP } .. .. interfaces { xe-0/0/3 { flexible-vlan-tagging; unit 0 { vlan-tags outer 2000 inner 200; <<<<< VRRP is configured over double tagged interface family inet { address 10.10.10.147/29 { vrrp-group 17 { virtual-address 10.10.10.145; priority 100; accept-data; } } } } } } .. PR1100383
After configuring related ae interface configuration, we might find some of ae interfaces disappear in MX-VC. It seemed that ae interfaces are not allocated MAC address from chassisd properly. * This issue only happens in the first configuration timing after rebooting/restarting chassisd. So even if you configure related ae interface configuration repeatedly, you cannot find this issue. When this issue happens these message will be seen in the messages logs. ------------------------------------------------- lab@router_re0> show log messages| match CHASSISD_MAC_ADDRESS_AE_ERROR Jun 26 16:04:34.064 router_re0 scchassisd[2008]: CHASSISD_MAC_ADDRESS_AE_ERROR: chassisd MAC address allocation error for ae4 Jun 26 16:04:34.105 router_re0 /kernel: Jun 26 16:04:34.064 router_re0 scchassisd[2008]: CHASSISD_MAC_ADDRESS_AE_ERROR: chassisd MAC address allocation error for ae4 ------------------------------------------------- Restore ae interfaces * This is not workaround. deactivate/activate ae interfaces. (We need to do this to all disappeared ae interfaces.) PR1100731
To ensure that the router or switch is reachable for management purposes while it boots or if the routing protocol process fails to start properly, we can configure a backup router, which is a router that is directly connected to the local router or switch (that is, on the same subnet) through its private management interface (for example, fxp0 or me0). When a backup router running IPv6 and a static route to reach the management network are configured, some invalid IPv6 routes are added to default forwarding-table on the master or the backup Routing Engine. PR1100981
Due to the fact that the error injection rate configured by user on Routing Engine via CLI command "bert-error-rate" may not be programmed in the hardware register, the PE-4CHOC3-CE-SFP, PB-4CHOC3-CE-SFP, MIC-3D-4COC3-1COC12-CE, and MIC-4COC3-1COC12-CE-H may fail to inject bit errors during a Bit Error Ratio Test (BERT). PR1102630
On MX240 or MX480 platform with at least two DC modules (PN: 740-027736) equipped, when shutting down one of the PEMs and then turn it on again, even the PEM is functioning, the "PEM Fan Fail" alarm might be observed on the device due to software logic bug. There is no way to clear the ALARM_REASON_PS_FAN_FAIL for I2C_ID_ENH_CALYPSO_DC_PEM once it has been raised. PR1106998
Layer 2 Features
Under rare circumstances it is possible for the DHCP drop counts for reason SEND ERROR to be incremented twice for a single failure. PR1009296
MTU change is not advised on the Ethernet ring protection (ERP) ring interfaces unless ring is in idle condition. Changing ring interface MTU while ring is not in idle state might result in change in the forwarding state of the interface which can lead to loop in the ring. PR1083889
When family bridge was configured and committed, l2ald repeated restarting with core. After l2ald repeated restarting several times, it stopped working due to thrashing condition. Core of l2ald will be seen with the following configuration. set interfaces fxp0 unit 0 family bridge interface-mode access set interfaces fxp0 unit 0 family bridge vlan-id 100 When the configuration is committed, message like following is logged and core is generated. l2ald[1624]: ../../../../../src/junos/usr.sbin/l2ald/l2ald_vpls_flood.c:3117: insist '!err' failed l2ald[1734]: ../../../../../src/junos/usr.sbin/l2ald/l2ald_vpls_flood.c:3117: insist '!err' failed l2ald[1769]: ../../../../../src/junos/usr.sbin/l2ald/l2ald_vpls_flood.c:3117: insist '!err' failed l2ald[1993]: ../../../../../src/junos/usr.sbin/l2ald/l2ald_vpls_flood.c:3117: insist '!err' failed l2ald[2195]: ../../../../../src/junos/usr.sbin/l2ald/l2ald_vpls_flood.c:3117: insist '!err' failed ... init: l2-learning is thrashing, not restarted PR1089358
During interface flaps, a high amount of TCN (Topology Change Notification) might get propagated causing other switches to get behind due to high amount of TCN flooding. This problem is visible after the change done from Junos OS Release 11.4R8 and later, which propagates TCN BPDU immediately and not in the pace of the 2 second BPDU. Hello interval to speed up topology change propagation. The root cause is that the TCNWHILE timer of 4 seconds is always reset upon receiving TCN notifications causing the high churn TCN propagation. PR1089580
In MX Series Virtual Chassis (MXVC) environment, when packets come from a interface (for example, xe-16/0/1.542) situated on one member of VC (for example, VC member 1), if the ingress Packet Forwarding Engine (for example,FPC16 PFE0,who runs hash to determine which interface it should send the packet to) decides that it should send the packet via another interface (for example, xe-4/0/1.670) situated on different member (for example, VC member 0), it will send the frame to member 0 via the vcp- intf. In case of xe-4/0/1.670 belongs to an AE bundle which has multiple child links, a hash need to be run on Packet Forwarding Engine carrying the VCP port (receiving side on member 0) to determine which one is the egress Packet Forwarding Engine within member 0 to send the packet out after vcp- intf gets the packet. This hash result should get the same result as the ingress Packet Forwarding Engine. If it is not the case, then the packet would get dropped on Packet Forwarding Engine on member 0. PR1097973
With scaled subscribers connected, restarting one of MPCs might cause subscribers unable to log in for about 2 minutes. PR1099237
MPLS
In Resource Reservation Protocol (RSVP) environment, if CoS-Based Forwarding (CBF) for per LSP (that filter out traffic not related to that LSP) is configured, and either the feature fast-reroute or link-protection is used on the device, when the primary link is down (for example, turning off the laser of the link), due to some next hops of the traffic may be deleted or reassigned to different class of traffic, and the RSVP local repair might fail to process more than 200 LSPs at one time, the traffic might get dropped by the filter on the device before the new next hop is installed. In this situation, the feature (fast reroute or link protection) may take longer time (for example, 1.5 seconds) to function and the traffic loss might be seen at the meantime. In addition, the issue may not be seen if the CBF for per LSP is not configured on the device. PR1048109
In race conditions, the rpd process on backup Routing Engine might crash when BGP routes are exported into LDP by egress-policy and configuration changes during the rpd process synchronizing the state to backup rpd process. PR1077804
On dual Routing Engine platform with GRES , the kernel synchronization process (ksyncd) might crash on the backup Routing Engine when adding of route pointing to indirect nexthop on system. PR1102724
Network Management and Monitoring
In rare cases, when the mib2d process attempts connection with the snmpd process and there are pending requests waiting to be finished, the mib2d process might crash and the CPU utilization is high around the same time as the crash happens. PR1076643
Due to a bug in jnxIfFcInline mib, a high order interface churn such as the one done by the submitter in this case, can lead to a mib2d core. The situation is recovered after the core and no other impact is seen. PR1105438
Platform and Infrastructure
Adaptive load-balance functionality is only supported for unicast traffic. If the aggregate bundle contains logical interfaces for bridge or vpls domains, flooded traffic might get dropped. PR821237
In dual Routing Engines scenario with NSR configuration, the configuration statement "groups re0 interfaces fxp0 unit 0" is configured. If disable interface fxp0, backup Routing Engine is unable to proceed with commit processing due to SIGHUP not received, the rpd process on backup Routing Engine might crash. PR974430
When Network Configuration Protocol (NETCONF) service is used on the device, after the NETCONF session is established, because all the output that contain <error> tag might be incorrectly converted into <rpc error>, the management daemon (mgd) may crash on the device. As the following example, the output that contains <error> tag may lead to the crash. user@re0> show subscribers address 1000 | display xml .. <error junos:style="input-error"> <<<<<< The output contain <error> tag and may trigger the crash. PR975284
On MX Series Virtual Chassis (MX-VC) platform, mirroring of OAM packets may not work as expected if the OAM packet is traversing through multiple Packet Forwarding Engines (for example, the mirrored port and VCP port are on separate Packet Forwarding Engines). PR1012542
In EVPN scenario, MPC may crash with core-file when any interface is deleted and add that interface to an aggregated Ethernet bundle or changing the ESI mode from all-active to single-active. PR1018957
LSI logical interface input packet and byte stats are also added to core logical interface stats, but when the LSI logical interface goes down and the core logical interface stats are polled, there is a dip in stats. The fix is to restore LSI logical interface stats to core logical interface before deleting the LSI logical interface. PR1020175
Under very rare situations, Packet Forwarding Engines on the following linecards, as well as the compact MX80/40/10/5 series, may stop forwarding transit traffic: - 16x10GE MPC - MPC1, MPC2. This occurs due to a software defect that slowly leaks the resources necessary for packet forwarding. Interfaces handled by the Packet Forwarding Engine under duress may exhibit incrementing 'Resource errors' in consecutive output of 'show interfaces extensive' output. A Packet Forwarding Engine reboot via the associated linecard or chassis reload is required to correct the condition. PR1058197
On MX Series router with frame-relay (FR) CCC to connect FR passport devices. If some of the FR circuits carry traffic without any valid FR encapsulations, the MX Series based Packet Forwarding Engine drops those frames. PR1059992
If a Radius server is configured as accounting server, when it is non-reachable, the auditd process might be stressed with huge number of audit logs to be sent to the accounting server, which might cause auditd to crash. PR1062016
Modifying IEEE-802.1ad rewrite-rule on the fly might be unable to change IEEE-802.1p ToS values for inner VLAN in QinQ. PR1062817
In Junos release 13.3R6 or 14.2R3, for PPPoE subscribers over the aggregated Ethernet (ae) interface, the output of "show interface statistics <pp> detail" command shows the ingress/egress traffic statistics for the aggregate interface instead of the statistics for PP/DEMUX logical interface. PR1069242
Having "shared-bandwidth-policer" on an aggregated ethernet interface; if a member interface flapped, the NPC which the interface belongs may restart. Similar issue may also happen when changing the firewall policer configuration. PR1069763
When Integrated routing and bridging (IRB) interface is configured with Virtual Router Redundancy Protocol (VRRP) in Layer 2 VPLS/bridge-domain, in corner cases after interface flapping, MAC filter ff:ff:ff:ff:ff:ff is cleared from the Packet Forwarding Engine hardware MAC table, so the IRB interface may drop all packets with destinations MAC address FFFF:FFFF:FFFF (e.g. ARP packet). PR1073536
It tries to check allotted power for all the FPCs, here in the CHASSISD_I2CS_READBACK_ERROR logs it shows for the FPCs which are not present in chassis. It just calls i2cs_readback() to read i2c device and fails there as these FPCs? slots are blank and prints those readback errors. Also the errors are harmless: "CHASSISD_I2CS_READBACK_ERROR: Readback error from I2C slave for FPC" Fix: Code to check 'if power has been allotted to this FPC', needs to be executed only if the FPC is present. PR1075643
When using the "ping detail" command, the interface number is provided on the output instead of the interface name. PR1078300
During a unified in-service software upgrade (ISSU), DHCP control traffic (renew/rebinds) might be dropped on ingress Packet Forwarding Engine. PR1079812
When an MX chassis network-services is "enhanced-ip" and an AE is part of a Layer 2 bridge (bridge-domain or VPLS), there is a possibility that an incorrect forwarding path might be installed causing traffic loss. This could happen when first applying the configuration, restarting the system or restarting the line card. PR1081999
On MX Series-based platform, the "RPF-loose-mode-discard" feature is not working when configured within a Virtual Router routing instance. The feature is working only when configured in the main instance. PR1084715
With MSDPC equipped on BNG, there might be a memory leak in ukernel, which eventually causes MSDPC to crash and restart. PR1085023
In Junos OS Releases 13.3R3, 14.1R1, 14.2R1, there is a new feature, an extra TLV term is added to accommodate the default action for the "next-interface" when the corresponding next-interface is down. While doing a unified ISSU from an image without the feature to an image with this feature, all MPCs might crash. PR1085357
If there are scaling unicast routes (e.g. 500k) in NG-MVPN VRF, and the provider-tunnel is PIM, when PIM on PE has multiple upstream neighbors and any of them could be its rpf neighbor, performing GRES/NSR Routing Engine switchover might cause multicast traffic loss due to the different view of rpf neighbor between the master Routing Engine and the slave Routing Engine. PR1087795
The prompt for SSH password changed in Junos OS Release 13.3, from "user@host's password:" to "Password:". This change breaks the logic in "JUNOS/Access/ssh.pm" which is located in /usr/local/share/perl/5.18.2/ on Ubuntu Linux, for example. PR1088033
On MX Series router with MPC1/1E, MPC2/2E line cards in a broadband edge environment with scaled (in this case 250K) subscribers, the FPC heap (dynamic memory) utilization increases significantly during an in-service software upgrade (ISSU). PR1088427
On MX Series platform with MPC/MIC or T4000 FPC5, TCP session with MS-Interface/AMS-Interface, configuration is not established successfully with the "no-destination-port" or "no-source-port" configuration statements configured under forwarding-options hierarchy level. PR1088501
Issue is specific to 64-Bit RPD and config-groups wildcard configuration specific as in the following case: set groups TEST routing-instances <*> routing-options multicast forwarding-cache family inet threshold suppress 200 set routing-instances vrf1 apply-groups TEST set routing-instances vrf1 routing-options multicast forwarding-cache family inet threshold suppress 600 With this daemon(rpd) reads suppressed value ?200? (i.e. coming from groups) instead of reading value ?600?from foreground and customer sees unexpected behavior with respect to threshold-suppress. Workaround: They can replace wildcard with actual routing-instance name as in below example: set groups TEST routing-instances vrf1 routing-options multicast forwarding-cache family inet threshold suppress 200 set routing-instances vrf1 apply-groups TEST set routing-instances vrf1 routing-options multicast forwarding-cache family inet threshold suppress 600 PR1089994
On MX Series router, if ifl (logical interface) is configured with VID of 0 and parent ifd (physical interface) with native-vlan-id of 0, when sending L2 traffic received on the ifl to Routing Engine, the VID 0 will not imposed, causing the frames to get dropped at Routing Engine. PR1090718
When an interface on MQ-based FPC is going to link down state, in-flight packet on interface transmit path will be stuck on the interface and never drained until the interface comes up again. As a result, small number of such stacked packets will be sent out when the interface is going to UP state. No other major impact should be seen after those packets are drained. PR1093569
On MX2020/2010 router, an SPMB core file will be seen if there are bad XF chips (fabric chip) on SFB, which might trigger Routing Engine/CB switchover. PR1096455
In 64-bit Junos OS environment, the Representational State Transfer (REST) API fails to start when configured with "set system services rest ...". PR1097266
When a P2MP LSP is added or deleted at ingress LSR, traffic loss is seen to existing sub-LSP(s) at transit LSR which replicates and forwards packet to egress PEs. This issue only affects MX Series based line card. PR1097806
The "shared-bandwidth-policer" configuration statement is used to enable configuration of interface-specific policers applied on an aggregated Ethernet bundle to match the effective bandwidth and burst-size to user-configured values. But this feature is broken from Junos OS Release 14.1R1 when "enhanced-ip" is configured on MX Series platform with pure MX Series-based line cards. The bandwidth/burst-size of policers attached to Aggregated Ethernet interfaces are not dynamically updated upon member link adding or deletion. PR1098486
On MX Series-based platform, when the type of the IPv6 traffic is non-TCP or non-UDP (for example, next header field is GRE or No Next Header for IPv6), if the traffic rate is high (for instance, higher than 3.5Mpps), the packet re-ordering may occur. PR1098776
On MX Series-based line cards, when the prefix-length is modified from higher value to lower value for an existing prefix-action, heap gets corrupted. Due to this corruption, the FPC might crash anytime when further configurations are added/deleted. The following operations might be considered as a workaround: Step 1. Delete the existing prefix-action and commit Step 2. Then re-create the prefix-action with newer prefix-length. PR1098870
In an MPLS L3VPN network with a dual-homed CE router connected to different PE routers, a protection path should be configured between the CE router and an alternate PE router to protect the best path. When BFD is enabled on the BGP session between the CE and the primary PE router, with local traffic flowing from another CE connected with the primary PE to this CE, after bringing the interface down on the best path, the local repair will be triggered by BFD session down, but it might fail due to a timing issue. This will cause slow converge and unexpected traffic drop. PR1098961
When the BFD is running on multi LU (lookup chip) Packet Forwarding Engine (such as MPC3 or MPC4), incoming BFD packet might be processed with a firewall filter on different logical-routers's loopback interface. If the firewall filter is discarding/rejecting BFD, the packets will be dropped incorrectly. PR1099608
On MX Series-based platform, before creating a new unilist nexthop, there is a check to see if there is at least 512k DoubleWords (DW) free. So, even the attempting NH requires only a small amount of memory (for example, < 100 DWs), if there is no such enough free DWs (that is, 512k), the check will fail and the end result is that the control plane will quit adding this NH prematurely - stopping at ~80% of capacity. With the fix, it will check for 64k free DWs which is lower reference watermark for available resource, thereby ensuring that can allocate resource. PR1099753
From Junos OS Release 14.1 and above, IPv6 mobility packets with Heartbeat option that the length of the mobility header (including the ethernet encapsulation and main IPv6 header) extends beyond 128 Bytes will be discarded as bad IPv6 option packet due to a logic error in packet handling. PR1100442
On MX Series platform, when using the 64-bit image, if the configuration statement "source-address" is configured for the "radius-server" as the following, the RADIUS request may not be sent to RADIUS server due to the failure of setting the "source-address" on the device. user@re0> show configuration system radius-server .. source-address 10.1.1.1; <<<<< The configuration statement that may cause the issue PR1103517
On T4000 platform with FPC Type-5 equipped, after performing unified ISSU, due to the fact that only 6 out of 16 temperature sensors may get initialized, the temperature reading for the line card may be shown as "Absent". PR1104240
Due to a software defect found in 13.3R7.3 and 14.1R5.4 , Juniper Networks strongly discourage the use of Junos OS software version 13.3R7.3 on routers with MQ-based MPC. This includes MX Series with MPC1, MPC2; all mid-range MX Series. PR1108826
Routing Policy and Firewall Filters
In Class-of-Service (CoS) environment, there is a possibility (happened twice so far and not reproducible in the lab) that routing protocol process (rpd) may crash because the CoS memory may get incorrectly freed and then allocated again. PR1062616
On the platform that M7i/M10i with enhanced CFEB, M320 with E3-FPC, M120, and MX with DPC, when the flood filter is configured in VPLS instance on the Packet Forwarding Engine, if the Packet Forwarding Engine receives a filter change (for example, FPC reboot occur and comes up), the line card may fail to program the filter. PR1099257
Routing Protocols
Support for the Pragmatic General Multicast protocol (daemon pgmd) is being phased out from Junos OS. In Junos OS Release 14.2, the CLI is now hidden (although the component is still there and configurable). In Junos OS Release 15.1 the code and its corresponding CLI are removed. PR936723
In PIM multicast-only fast reroute (MoFRR) environment, when issuing CLI command "show multicast route extensive" on egress edge router, due to missing null check while showing label information for reverse-path forwarding (RPF) nexthop, an error might be seen in the output of the command. In addition, the routing protocol process (rpd) may crash on the device. PR983140.
For the pim nbr which is not directly connected ( that is, nbr on unnumbered interface, or p2p interface with different subnet), pim join is not able to find the correct upstream nbr which results in join not propagating to the upstream nbr . show command for pim join shows upstream nbr "unknown" . Issue is present in the 15.1R1 release. PR1069896
In mutli-topologies IS-IS scenario, there is huge difference between estimated free bytes and actual free bytes when generating LSP with IPv6 Prefix. It might cause LSP fragment exhaustion. PR1074891.
There are two issues in the PR: (1) In multicast environment, Incoming interface list (IIF) list has only RPF interface, designated forwarder (DF) winners are not added in the list in backup Routing Engine. (2) "Number of downstream interfaces" in show pim join extensive is not accounting Pseudo-VXLAN interface. PR1082362
If a policy statement referred to a routing-table, but the corresponding routing instance is not fully configured (ie. no instance-type), commit such configuration might cause the rpd process to crash. PR1083257.
With Multicast Source Discovery Protocol (MSDP) and nonstop active routing (NSR) configured on the Protocol Independent Multicast (PIM) sparse-mode rendezvous point (RP), the rpd process might permanently get stuck when multicast traffic received shortly after Routing Engines switchover. PR1083385
When there are a number of secondary BGP routes in inet.0, an SNMP walk of inet.0 by the bgp4 MIB can cause a core if the corresponding primary routes are being deleted. PR1083988.
1. configure the ospf and ospf3 in all routers 2. configure node protection 3. check for 22.1.1.0 any backup is present 4. enable pplfa all 5. check for 22.1.1.0 any pplfa backup is present through r2 we are not seeing any pplfa backup for 22.1.1.0 PR1085029
When BGP route is leaked to a routing-instance and there is an import policy to overwrite the route preference, if damping is also configured in BGP, the BGP routes which were copied to second table cannot be deleted after routes were deleted in master table. This is a day-1 issue. PR1090760
When removing BGP Prefix-Independent Convergence (PIC) from the configuration, the expected behavior is that any protected path would become unprotected. But in this case, the multipath entry that contains the protection path (which is supposed to be removed) remains active, until BGP session flaps or the route itself flaps. As a workaround, we can use "commit full" command to correct or to commit. PR1092049
In BGP environment, when configuring RIB copy of routes from primary routing table to secondary routing table (for example, by using the CLI command "import-rib [ inet.0 XX.inet.0]") and if the second route-table's instance is type "forwarding", due to the BGP routes in secondary routing table may get deleted and not correctly re-created, the routes may be gone on every commit (even commit of unrelated changes). As a workaround, for re-creating the BGP routes in secondary route table, use CLI command "commit full" to make configuration changes. PR1093317
In Junos OS Release 9.1 and later, RFC 4893 introduces two new optional transitive BGP attributes, AS4_PATH and AS4_AGGREGATOR. These new attributes are used to propagate 4-byte AS path information across BGP speakers that do not support 4-byte AS numbers. In this case, when AS4_AGGREGATOR attribute (18) is received from a 2-byte AS peer (note AS4_AGGREGATOR attribute is only received when the aggregator has 4-byte AS but this peer only supports 2-byte AS), NSR synchronization with standby Routing Engine would fail, causing session constantly bouncing on standby Routing Engine (hogging CPU). PR1093615
The rpd process might crash when resolve-vpn and rib inet.3 are configured under separate levels (BGP global, group and peer). The fix is if anybody configures a family at a lower level, reset the state created by either of configuration statements from higher levels. This behavior conforms with our current behavior of family configuration - which is that any configuration at a lower level is honored and the higher level configuration is reset. PR1094499.
When BGP routes has multiple protocol nexthops including discard/reject and other IGP nexthops, the discard/reject nexthop will be selected as BGP nexthop, which will cause traffic loss. PR1096363
When polling SNMP OID isisPacketCounterTable 1.3.6.1.2.1.138.1.5.3, the rpd process might crash. PR1101080.
When the IS-IS configurations have been removed, the IS-IS LSDB contents get flushed. If at the same time of this deletion process, there is an SPF execution (that is, try to access the data structures at same time when/a fraction of seconds after freeing its content), routing protocol process (rpd) crash occurs. PR1103631
Services Applications
When an MX Series router configured as an LNS sends an Access-Request message to RADIUS for an LNS subscriber, the LNS now includes the Called-Station-ID-Attribute when it receives AVP 21 in the ICRQ message from the LAC. PR790035
When an MS-DPC PIC reboots due to a crash or manual intervention, it might get stuck in a booting loop if the MS-DPC up-time is more than 49 days and 17 hours. After 5 consecutive boot failures, the MS-DPC PIC will go offline automatically and give the following error message: [ 15:21:22.344 LOG: Err] ICHIP( 0): SPI4 Training failed while waiting for PLL to get locked, ichip_sr a_spi4_r x_snk_init_s tatus_clk [ 15:21:22.344 LOG: Err]CMSP C:I-Chip( 0)SPI4RxSinkinitstatusclockfailed,cmsdpc_spi4_init [15:21:22.344 LOG: Err] CMX: I(0) ASIC SPI4 init failed [ 15:21:22.3 79 LOG: Err] Node for service control ifl 68, is already present [ 15:21:23.20 7 LOG: Err] ASER0 SPI-4 XLR source core OOF did not go low in 20ms. [ 15:21:23.208 LOG: Err] ASER/XLR0 spi4 stop src train failed! [ 15:21:23.208 LOG:Err]ASER0 XLRSPI-4 sinkcoreDPAincompletein20ms. [15:21:23.208 LOG: Err] ASER/XLR0 spi4 sink core init failed! [ 15:21:2 4.465 LOG: Err] ICHIP( 0): SPI4 Stats Unexpected 2'b 11 Error, isra_spi4_p arse_panic_err ors [ 15:21:2 4.465 LOG: Err] ICHIP( 0): SPI4 Tx Lost Sync Error, isra_spi4_p arse_panic_err ors . In order to recover from this state, the whole MS-DPC needs to be rebooted. PR828649
In IPsec environment, after performing the Routing Engine switchover (for example, performing Graceful Routing Engine Switchover) or chassis reboot (that is, whole device is powered down and powered UP again), due to the key management daemon (kmd) may be launched before the Routing Engine mastership is finalized, it may stop running on the new master Routing Engine. PR863413
On MX Series platform, when using the MS-DPC with MPSDK to support Captive Portal Content Delivery (cpcd) service, the MAC might get stuck on the FPC due to processing the high rate of packets (for example, 5kpps HTTP traffic). In addition, reloading the affected FPC might only temporarily resolve the issue while it will appear again once scaling up.PR1037143
In CG-NAT or statefull firewall environment, due to a null pointer check bug, the MS-DPC might crash every few hours. Note that this is a regression issue. PR1079981
The crash happens if in a http flow, the flow structure is allocated at a particular memory region. There is no workaround but the chances of hitting this issue are very low PR1080749
On Layer 2 Tunnel Protocol (L2TP) network server (LNS), during L2TP session establishment, when receiving Incoming-Call-Connected (ICCN) messages with Last Sent LCP CONFREQ Attribute Value Pair (AVP) but without Initial Received LCP CONFREQ and Last Received LCP CONFREQ AVPs, the jl2tpd process might crash. PR1082673
On Trivial File Transfer Protocol (TFTP) Application Layer Gateway (ALG) with NAT translation type "dynamic-nat44" configured, MS-DPC/MS-MPC/MS-MIC might crash when processes the TFTP packets. PR1091179
On M Series platform, in Layer 2 Tunneling Protocol (L2TP) network server (LNS) environment, not all attributes (Missing NAS-Identifier, NAS-Port-Type, Service-Type, Framed-Protocol attributes) within Accounting-Request packet are sending to the RADIUS server. PR1095315
If MS-DPC is used in CG-NAT environment, in a very rare condition, when the MS-DPC tries to delete a NAT mapping entry (e.g. entry timeout), error might occur and the MS-DPC might get rebooted and then generate a core file. PR1095396
Some values of MIB object jnxSrcNatStatsEntry might be doubled when AMS (or rsp) interface and NAT are configured together. PR1095713
Software Installation and Upgrade
Add "on <host>" argument to to "request system software validate" to allow validation on a remote host/Routing Engine running Junos. PR1066150
Subscriber Access Management
In subscriber management environment, after deactivating a service with Change of Authorization (CoA) dynamic requests, if the Acct-Stop response is not received, the Broadband Network Gateway (BNG) will send CoA NAK message when the same service is activated again. The authd process crash will be observed and some sessions are stuck and cannot be terminated after terminating sessions. PR1004478
The authd process memory leaks slowly when subscribers login and logout, which eventually leads the process to crash and generate a core file. PR1035642
On MX Series routers, the generic authentication service process (authd) may fail to send Acct-off message to the RADIUS server. This is because management daemon (mgd) might not notify the authd prior to executing system reboot or system shutdown. Also, the authd might fail to generate the Acct-off message as well when it is terminated and there are no active subscribers. PR1053044
In subscriber management environment with Remote Authentication Dial In User Service (RADIUS) server configured, when performing scaling subscribers login/logout, the device may stuck in RADIUS communication. PR1070468
In subscriber management environment, when dual-stack service is activated by the Change of Authorization (CoA) request from the Radius Server, both families will be activated in the same profile response. Due to a software defect, the service accounting session id is not generated properly and the Service Accounting Messages and Interim-updates failed to be sent out. PR1071093
Subscriber is not coming up when CISCO AVPair VSA value is returned in Radius ACCESS-ACCEPT packets in certain scenarios. PR1074992
A CoA Request containing LI attributes cannot contain any non-LI service activations, de-activations or variable modifications. PR1079036
If authentication-order is configured as none under access profile and domain-name servers (DNS) are configured locally under access profile, then the subscriber will login but will not get DNS addresses which were configured locally. PR1079691
In scaled DHCP subscribers environment, the authd process might crash and generate a core file after clearing DHCP binding or logout subscribers. PR1094674
User Interface and Configuration
Due to a change in an existing PR, group names in the configuration must be a string of alphanumericals, dashes or underscores. There is not workaround other than following the group name instructions. PR1087051
VPNs
Problem, trigger and symptom: On dual Routing Engines, if mvpn protocol itself is not configured, and non stop routing is enabled, the show command "show task replication" on master Routing Engine will list MVPN protocol even though it is not configured. Other than the misleading show output which may be slightly confusing to the user/customer, there is no functional impact due to this issue as such. There is no workaround available. PR1078305
In PIM Draft-Rosen Multicast VPN (MVPN) environment, in a setup where active C-PR, standby C-RP, C-receivers, C-source are located in different VPN site of MVPN instance, once the link to active C-RP is flapped, PE which connects to C-receivers would send (*,g) join and (s,g,rpt) prune towards standby C-RP, when the PE which connects to standby C-RP receives the (*,g) join and (s,g, rpt) prune over mt-, it ends up updating the (s,g) forwarding entry with mt- as downstream, which is already the incoming interface (IIF). This creates a forwarding loop due to missing check if IIF is same as OIF when PIM make-before-break (MBB) join load-balancing feature is enabled and as a result traffic gets looped back into the network. Loop once formed will remain at least for 210 seconds till the delayed prune timer expires. After this, IIF is updated to the interface towards standby C-RP finally. PR1085777
In NG-MVPN spt-only mode with a PE router acts as the rendezvous point (RP), if there are only local receivers, the unnecessary multicast traffic continuously goes to this RP and dropped though it is not in the shortest-path tree (SPT) path from source to receiver. PR1087948
When there are more than 2000 outgoing interfaces (OIFs) for a same multicast group on MVPN egress PE, the multicast forwarding entries installed by MVPN might have duplicated OIFs and resulting in duplicated traffic. PR1095877
In Internet multicast over an MPLS network by using next-generation Layer 3 VPN multicast (NG-MVPN) environment, when rib-groups are configured to use inet.2 as RPF rib for Global Table Multicast (GTM, internet multicast) instance, the ingress PE may fail to add P-tunnel as downstream even after receiving BGP type-7 routes. In addition, this issue only affects GTM. PR1104676