Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ND: Lost of Global IPV6 on node after sending lot of UDP frame from BR #5790

Closed
biboc opened this issue Aug 31, 2016 · 46 comments
Closed

ND: Lost of Global IPV6 on node after sending lot of UDP frame from BR #5790

biboc opened this issue Aug 31, 2016 · 46 comments
Assignees
Labels
Area: network Area: Networking Type: bug The issue reports a bug / The PR fixes a bug (including spelling errors)

Comments

@biboc
Copy link
Member

biboc commented Aug 31, 2016

I'm using two SAMR21, one with BR (on A) and the other (on B) with
gnrc_networking example.
I'm on April release.
Switch on A (border router) then switch on B, I'm able to send UDP
message from Linux to 2001:db8 address of B.
If I try to send lot of data (every 100ms), it works for a while and
then B does not receive message anymore. After I see it failed, I
stopped the loop sending UDP message, I checked on BR and 2001:db8
address of B was not here anymore. I can ping B with fe80:: address on
iface 6 but can't with 2001:db8.

Why or how can a node be removed from ncache? Why the node does not
automatically reconnect with BR?

Answer from Martine:
Hi,
can you check if this is #5467?

Why or how can a node be removed from ncache? Why the node does not
automatically reconnect with BR?

As the implementer of the neighbor discovery I can say this: because
the neighbor discovery is sh** ;-). It was written in a haste and has
a lot of design flaws. I'm working on a replacement (see
#5704, happy to hear your
thoughts ;-)), but sadly I did not find some time to put heavy
implementation efforts into this yet (hopefully in autumn though, so
we have something for the October release, fingers crossed).

Thanks for reporting and kind regards,
Martine

@biboc biboc added Type: bug The issue reports a bug / The PR fixes a bug (including spelling errors) Area: network Area: Networking labels Aug 31, 2016
@kaspar030 kaspar030 added the GNRC label Sep 1, 2016
@biboc
Copy link
Member Author

biboc commented Oct 18, 2016

I tried again to send lot of data in a row (20 packets/sec)
And actually NDP sends also messages between these packets and at some point, the node shows:
ndp internal: set 2001:db8::fec2:3d00:4:abb1 state to STALE (go into DELAY on next packet)
Then I can't communicate to the node with global IPV6 but I can with local IPV6

@miri64
Copy link
Member

miri64 commented Oct 18, 2016

Directly after?

@miri64 miri64 added this to the Release 2016.10 milestone Oct 18, 2016
@biboc
Copy link
Member Author

biboc commented Oct 18, 2016

node starts and get global IPV6: great
I start sending data (~20 packets/sec), I receive 380 packets and it stops with last message:
ndp internal: set 2001:db8::fec2:3d00:4:abb1 state to STALE (go into DELAY on next packet)

In the middle, I've got NDP receiving SL2A:

(h:15 m:20 s:32) received line: 113
(h:15 m:20 s:32) Waiting for incoming packet...
(h:15 m:20 s:32) received line: 114
(h:15 m:20 s:32) Waiting for incoming packet...
(h:15 m:20 s:32) received line: 115
(h:15 m:20 s:32) Waiting for incoming packet...
(h:15 m:20 s:32) received line: 116
(h:15 m:20 s:32) Waiting for incoming packet...
(h:15 m:20 s:32) ndp: received SL2A (link-layer address: fc:c2:3d:00:00:04:ab:b1)
(h:15 m:20 s:32) ndp internal: send neighbor advertisement (iface: 6, tgt: 2001:db8::fec2:3d00:4:c0ba, dst: 2001:db8::fec2:3d00:4:abb1, supply_tl2a: 0)
(h:15 m:20 s:32) ndp internal: set 2001:db8::fec2:3d00:4:abb1 state to DELAY (probe with unicast NS in 5 seconds)
(h:15 m:20 s:33) received line: 121
...
(h:15 m:20 s:45) received line: 380
(h:15 m:20 s:45) Waiting for incoming packet...
(h:15 m:20 s:55) ndp internal: set 2001:db8::fec2:3d00:4:abb1 state to STALE (go into DELAY on next packet)

And you can see that node misses some lines when NDP is talking

@miri64
Copy link
Member

miri64 commented Oct 18, 2016

what are those "lines"?

@biboc
Copy link
Member Author

biboc commented Oct 18, 2016

Some UDP testing lines ("hello world')

@miri64
Copy link
Member

miri64 commented Oct 18, 2016

Ah okay, thanks. I will consider this for my tests when testing the new ND!

@biboc
Copy link
Member Author

biboc commented Oct 18, 2016

This mainly show that if there are many messages arriving at the same time on a node, it can loose its global IPV6 address

Do you know why or how packets from 116 to 121 are not received?

@biboc
Copy link
Member Author

biboc commented Oct 18, 2016

If I do the same test with local IPV6, it will loose some line when ND runs but it will continue to the end because local IPV6 is not lost (comparing to global IPV6)

@miri64
Copy link
Member

miri64 commented Oct 18, 2016

Do you know why or how packets from 116 to 121 are not received?

I have multiple suspicions.

  • Some kind of collision at L2
  • Next hop determination fails due to the packet being in DELAY state
  • some issue in the packet buffer
  • full packet queues of the IPv6 thread

Any thing sounds like something you observed?

@biboc
Copy link
Member Author

biboc commented Oct 18, 2016

Some kind of collision at L2
Next hop determination fails due to the packet being in DELAY state
some issue in the packet buffer
full packet queues of the IPv6 thread

I can test but I need to know where to debug?
I didn't observed more than one I said yet

@miri64
Copy link
Member

miri64 commented Oct 18, 2016

I can test but I need to know where to debug?

Try the following:

  • Some kind of collision at L2

Check with a sniffer

  • Next hop determination fails due to the packet being in DELAY state

Add DEBUG() statements before any return of gnrc_sixlowpan_nd_next_hop_l2addr in gnrc_sixlowpan_nd.c

  • some issue in the packet buffer

Call gnrc_pktbuf_stats() whenever this is occuring (if it helps: this should be easily implementable as a shell commands).

  • full packet queues of the IPv6 thread

Call msg_queue_print in _event_loop in gnrc_ipv6.c before and after the msg_receive() call.

@biboc
Copy link
Member Author

biboc commented Oct 18, 2016

With IPV6 debug, I added the number of lost lines since the start of sending packets:

(h:16 m:0 s:29) ipv6: Received (src = fe80::fec2:3d00:4:abb1, dst = 2001:db8::fec2:3d00:4:c0ba, next header = 17, length = 31)
(h:16 m:0 s:29) ipv6: forward nh = 17 to other threads
(h:16 m:0 s:29) ipv6: Send receive command for 0x20002cd8 to 5
(h:16 m:0 s:29) ipv6: waiting for incoming message.
(h:16 m:0 s:29) received line: 360
(h:16 m:0 s:29) Nb lost lines: 81
(h:16 m:0 s:29) Waiting for incoming packet...
(h:16 m:0 s:29) ipv6: GNRC_NETAPI_MSG_TYPE_RCV received
(h:16 m:0 s:29) ipv6: Received (src = fe80::fec2:3d00:4:abb1, dst = 2001:db8::fec2:3d00:4:c0ba, next header = 17, length = 31)
(h:16 m:0 s:29) ipv6: forward nh = 17 to other threads
(h:16 m:0 s:29) ipv6: Send receive command for 0x20002cd8 to 5
(h:16 m:0 s:29) ipv6: waiting for incoming message.
(h:16 m:0 s:29) received line: 361
(h:16 m:0 s:29) Nb lost lines: 81
(h:16 m:0 s:29) Waiting for incoming packet...
(h:16 m:0 s:29) ipv6: GNRC_NETAPI_MSG_TYPE_RCV received
(h:16 m:0 s:29) ipv6: Received (src = fe80::fec2:3d00:4:abb1, dst = 2001:db8::fec2:3d00:4:c0ba, next header = 17, length = 31)
(h:16 m:0 s:29) ipv6: forward nh = 17 to other threads
(h:16 m:0 s:29) ipv6: Send receive command for 0x20002cd8 to 5
(h:16 m:0 s:29) ipv6: waiting for incoming message.
(h:16 m:0 s:29) received line: 362
(h:16 m:0 s:29) Nb lost lines: 81
(h:16 m:0 s:29) Waiting for incoming packet...
 (h:16 m:0 s:37) ipv6: GNRC_NETAPI_MSG_TYPE_RCV received
(h:16 m:0 s:37) ipv6: Received (src = fe80::fec2:3d00:4:abb1, dst = ff15::abcd, next header = 17, length = 14)
(h:16 m:0 s:37) ipv6: packet destination not this host
(h:16 m:0 s:37) ipv6: decrement hop limit to 63
(h:16 m:0 s:37) ipv6: do not forward packets with link-local source or destination address
(h:16 m:0 s:37) ipv6: waiting for incoming message.
(h:16 m:0 s:39) ipv6: Neigbor cache state timeout received
(h:16 m:0 s:39) ndp internal: set 2001:db8::fec2:3d00:4:abb1 state to STALE (go into DELAY on next packet)
(h:16 m:0 s:39) ipv6: waiting for incoming message.
(h:16 m:1 s:37) ipv6: GNRC_NETAPI_MSG_TYPE_RCV received
(h:16 m:1 s:37) ipv6: Received (src = fe80::fec2:3d00:4:abb1, dst = ff15::abcd, next header = 17, length = 14)
(h:16 m:1 s:37) ipv6: packet destination not this host
(h:16 m:1 s:37) ipv6: decrement hop limit to 63
(h:16 m:1 s:37) ipv6: do not forward packets with link-local source or destination address
(h:16 m:1 s:37) ipv6: waiting for incoming message.

@biboc
Copy link
Member Author

biboc commented Oct 18, 2016

I added the followings:

Add DEBUG() statements before any return of gnrc_sixlowpan_nd_next_hop_l2addr in gnrc_sixlowpan_nd.c

Call msg_queue_print in _event_loop in gnrc_ipv6.c before and after the msg_receive() call.

Call gnrc_pktbuf_stats() whenever this is occuring (if it helps: this should be easily implementable as a shell commands).

gnrc_pktbuf_stats() is called every received packet

(h:16 m:19 s:21) received line: 363
(h:16 m:19 s:21) Nb lost lines: 63
(h:16 m:19 s:21) Waiting for incoming packet...
(h:16 m:19 s:21) Message queue of thread 4
(h:16 m:19 s:21)     size: 8 (avail: 0)
(h:16 m:19 s:21) received line: 364
(h:16 m:19 s:21) Nb lost lines: 63
(h:16 m:19 s:21) Waiting for incoming packet...
(h:16 m:19 s:21) ndp: received SL2A (link-layer address: fc:c2:3d:00:00:04:ab:b1)
(h:16 m:19 s:21) ndp internal: send neighbor advertisement (iface: 6, tgt: 2001:db8::fec2:3d00:4:c0ba, dst: 2001:db8::fec2:3d00:4:abb1, supply_tl2a: 0)
(h:16 m:19 s:21) Message queue of thread 4
(h:16 m:19 s:21)     size: 8 (avail: 1)
(h:16 m:19 s:21)     * 5: sender: 4, type: 0x0514, content: 536882104 (0x20002bb8)
(h:16 m:19 s:21) gnrc_sixlowpan_nd_next_hop_l2addr: gnrc_ipv6_nc_get_l2_addr(l2addr, l2addr_len, nc_entry)
(h:16 m:19 s:21) Message queue of thread 4
(h:16 m:19 s:21)     size: 8 (avail: 1)
(h:16 m:19 s:21)     * 6: sender: 3, type: 0x0513, content: 536881840 (0x20002ab0)
(h:16 m:19 s:21) Message queue of thread 4
(h:16 m:19 s:21)     size: 8 (avail: 0)
(h:16 m:19 s:21) received line: 365
(h:16 m:19 s:21) Nb lost lines: 63
(h:16 m:19 s:21) Waiting for incoming packet...
(h:16 m:19 s:21) Message queue of thread 4
(h:16 m:19 s:21)     size: 8 (avail: 0)
(h:16 m:19 s:21) received line: 367
(h:16 m:19 s:21) Nb lost lines: 64
(h:16 m:19 s:21) Waiting for incoming packet...
(h:16 m:19 s:22) Message queue of thread 4
(h:16 m:19 s:22)     size: 8 (avail: 0)
(h:16 m:19 s:22) received line: 368
(h:16 m:19 s:22) Nb lost lines: 64
(h:16 m:19 s:22) Waiting for incoming packet...
(h:16 m:19 s:22) Message queue of thread 4
(h:16 m:19 s:22)     size: 8 (avail: 0)
(h:16 m:19 s:22) received line: 369
(h:16 m:19 s:22) Nb lost lines: 64
(h:16 m:19 s:22) Waiting for incoming packet...
(h:16 m:19 s:22) Message queue of thread 4
(h:16 m:19 s:22)     size: 8 (avail: 0)
(h:16 m:19 s:22) received line: 370
(h:16 m:19 s:22) Nb lost lines: 64
(h:16 m:19 s:22) Waiting for incoming packet...
(h:16 m:19 s:22) Message queue of thread 4
(h:16 m:19 s:22)     size: 8 (avail: 0)
(h:16 m:19 s:22) received line: 371
(h:16 m:19 s:22) Nb lost lines: 64
(h:16 m:19 s:22) Waiting for incoming packet...
(h:16 m:19 s:22) Message queue of thread 4
(h:16 m:19 s:22)     size: 8 (avail: 0)
(h:16 m:19 s:22) received line: 372
(h:16 m:19 s:22) Nb lost lines: 64
(h:16 m:19 s:22) Waiting for incoming packet...
(h:16 m:19 s:22) Message queue of thread 4
(h:16 m:19 s:22)     size: 8 (avail: 0)
(h:16 m:19 s:22) received line: 373
(h:16 m:19 s:22) Nb lost lines: 64
(h:16 m:19 s:22) Waiting for incoming packet...
(h:16 m:19 s:22) Message queue of thread 4
(h:16 m:19 s:22)     size: 8 (avail: 0)
(h:16 m:19 s:22) received line: 374
(h:16 m:19 s:22) Nb lost lines: 64
(h:16 m:19 s:22) Waiting for incoming packet...
(h:16 m:19 s:22) Message queue of thread 4
(h:16 m:19 s:22)     size: 8 (avail: 0)
(h:16 m:19 s:22) received line: 375
(h:16 m:19 s:22) Nb lost lines: 64
(h:16 m:19 s:22) Waiting for incoming packet...
(h:16 m:19 s:22) Message queue of thread 4
(h:16 m:19 s:22)     size: 8 (avail: 0)
(h:16 m:19 s:22) received line: 376
(h:16 m:19 s:22) Nb lost lines: 64
(h:16 m:19 s:22) Waiting for incoming packet...
(h:16 m:19 s:22) Message queue of thread 4
(h:16 m:19 s:22)     size: 8 (avail: 0)
(h:16 m:19 s:22) received line: 377
(h:16 m:19 s:22) Nb lost lines: 64
(h:16 m:19 s:22) Waiting for incoming packet...
(h:16 m:19 s:22) Message queue of thread 4
(h:16 m:19 s:22)     size: 8 (avail: 0)
(h:16 m:19 s:22) received line: 378
(h:16 m:19 s:22) Nb lost lines: 64
(h:16 m:19 s:22) Waiting for incoming packet...
(h:16 m:19 s:22) Message queue of thread 4
(h:16 m:19 s:22)     size: 8 (avail: 0)
(h:16 m:19 s:22) received line: 379
(h:16 m:19 s:22) Nb lost lines: 64
(h:16 m:19 s:22) Waiting for incoming packet...
(h:16 m:19 s:22) Message queue of thread 4
(h:16 m:19 s:22)     size: 8 (avail: 0)
(h:16 m:19 s:22) received line: 380
(h:16 m:19 s:22) Nb lost lines: 64
(h:16 m:19 s:22) Waiting for incoming packet...
(h:16 m:19 s:22) Message queue of thread 4
(h:16 m:19 s:22)     size: 8 (avail: 0)
(h:16 m:19 s:22) received line: 381
(h:16 m:19 s:22) Nb lost lines: 64
(h:16 m:19 s:22) Waiting for incoming packet...
(h:16 m:19 s:32) ndp internal: set 2001:db8::fec2:3d00:4:abb1 state to STALE (go into DELAY on next packet)
(h:16 m:19 s:32) Message queue of thread 4
(h:16 m:19 s:32)     size: 8 (avail: 0)
(h:16 m:19 s:42) Message queue of thread 4
(h:16 m:19 s:42)     size: 8 (avail: 0)

@miri64
Copy link
Member

miri64 commented Oct 18, 2016

I'm a little bit confused if this is the sender or the receiver now? Also, come to think of it received line: xxx suggests to me that the lines don't get lost but that the message is just not printed...

(h:16 m:19 s:21) received line: 365
(h:16 m:19 s:21) Nb lost lines: 63
(h:16 m:19 s:21) Waiting for incoming packet...
(h:16 m:19 s:21) Message queue of thread 4
(h:16 m:19 s:21)     size: 8 (avail: 0)
(h:16 m:19 s:21) received line: 367
(h:16 m:19 s:21) Nb lost lines: 64

Seems like none of my suspicions are true (if this is the sender)

@biboc
Copy link
Member Author

biboc commented Oct 18, 2016

This is the receiver, I only gave receiver output.
Line are not received because you can see that lost lines var is incremented

@miri64
Copy link
Member

miri64 commented Oct 18, 2016

This is the receiver, I only gave receiver output.
Line are not received because you can see that lost lines var is incremented

I have 2 questions:

  • How do you no the line got lost?
  • How is the loss of the global IPv6 address at the receiver's neighbor cache related to packet loss?

I think I still don't understand your setup :(, but maybe the issue is a duplicate of #5122?

Add DEBUG() statements before any return of gnrc_sixlowpan_nd_next_hop_l2addr in gnrc_sixlowpan_nd.c

This only helps on the sender side.

@biboc
Copy link
Member Author

biboc commented Oct 18, 2016

Ok I've got a BR + one node.
BR sends UDP packets (20/sec) with payload:"[n° packet]:hello world"
I showed here the output of the node

How do you no the line got lost?

Because each sent line has a number in the payload :-)

How is the loss of the global IPv6 address at the receiver's neighbor cache related to packet loss

Good question, only thing I see is that when ND runs (SL2A solicitation), the receiver misses some lines. And if the packets are sent to global IPV6 of the node, it stops receiving at some point as shown earlier.

And no it is not related to #5122 IMO because I don't need to wait 15-20min to lose global IPV6, I only need to send lot of UDP packets

Is it clearer?

@miri64
Copy link
Member

miri64 commented Oct 19, 2016

Yes. Can you do the same debugging on the BR? Or isn't that a RIOT node? Another theory I have might be, that the missing MAC might let you loose packets (both UDP and NDP) and that the issue stems from that.

@biboc
Copy link
Member Author

biboc commented Oct 20, 2016

Ok I got debug for BR and node. BR tries to send 1000 lines (with payload= "[Num Line] hello". I kept only interesting part of debug.

node:

Waiting for incoming packet...
received line: 0
Nb lost lines: 0
Waiting for incoming packet...
received line: 1
Nb lost lines: 0
...
Waiting for incoming packet...
received line: 116
Nb lost lines: 0
Waiting for incoming packet...
ndp: received SL2A (link-layer address: fc:c2:3d:00:00:04:ab:b1)
ndp internal: send neighbor advertisement (iface: 6, tgt: 2001:db8::fec2:3d00:4:c0ba, dst: 2001:db8::fec2:3d00:4:abb1, supply_tl2a: 0)
ndp internal: set 2001:db8::fec2:3d00:4:abb1 state to DELAY (probe with unicast NS in 5 seconds)
received line: 119
Nb lost lines: 2
...
Waiting for incoming packet...
received line: 130
Nb lost lines: 2
Waiting for incoming packet...
ndp: received SL2A (link-layer address: fc:c2:3d:00:00:04:ab:b1)
ndp internal: send neighbor advertisement (iface: 6, tgt: 2001:db8::fec2:3d00:4:c0ba, dst: 2001:db8::fec2:3d00:4:abb1, supply_tl2a: 0)
received line: 135
Nb lost lines: 6
...
Waiting for incoming packet...
received line: 148
Nb lost lines: 6
Waiting for incoming packet...
ndp: received SL2A (link-layer address: fc:c2:3d:00:00:04:ab:b1)
ndp internal: send neighbor advertisement (iface: 6, tgt: 2001:db8::fec2:3d00:4:c0ba, dst: 2001:db8::fec2:3d00:4:abb1, supply_tl2a: 0)
received line: 151
Nb lost lines: 8
...
Waiting for incoming packet...
received line: 162
Nb lost lines: 8
Waiting for incoming packet...
received line: 163
Nb lost lines: 8
Waiting for incoming packet...
ndp internal: set 2001:db8::fec2:3d00:4:abb1 state to PROBE (probe with 3 unicast NS every 1000000 us)
ndp internal: send neighbor solicitation (iface: 6, src: @!,  tgt: 2001:db8::fec2:3d00:4:abb1, dst: 2001:db8::fec2:3d00:4:abb1)
6lo nd: address registration successful
ndp internal: set 2001:db8::fec2:3d00:4:abb1 state to REACHABLE (reachable time = 18000000 us)
received line: 214
Nb lost lines: 58
Waiting for incoming packet...
received line: 215
Nb lost lines: 58
...
Waiting for incoming packet...
received line: 329
Nb lost lines: 58
Waiting for incoming packet...
ndp: received SL2A (link-layer address: fc:c2:3d:00:00:04:ab:b1)
ndp internal: send neighbor advertisement (iface: 6, tgt: 2001:db8::fec2:3d00:4:c0ba, dst: 2001:db8::fec2:3d00:4:abb1, supply_tl2a: 0)
received line: 332
Nb lost lines: 60
...
Waiting for incoming packet...
received line: 347
Nb lost lines: 60
Waiting for incoming packet...
ndp: received SL2A (link-layer address: fc:c2:3d:00:00:04:ab:b1)
ndp internal: send neighbor advertisement (iface: 6, tgt: 2001:db8::fec2:3d00:4:c0ba, dst: 2001:db8::fec2:3d00:4:abb1, supply_tl2a: 0)
received line: 350
Nb lost lines: 62
...
Waiting for incoming packet...
received line: 362
Nb lost lines: 62
Waiting for incoming packet...
ndp: received SL2A (link-layer address: fc:c2:3d:00:00:04:ab:b1)
ndp internal: send neighbor advertisement (iface: 6, tgt: 2001:db8::fec2:3d00:4:c0ba, dst: 2001:db8::fec2:3d00:4:abb1, supply_tl2a: 0)
received line: 366
Nb lost lines: 65
...
Waiting for incoming packet...
received line: 376
Nb lost lines: 65
Waiting for incoming packet...
 ndp internal: set 2001:db8::fec2:3d00:4:abb1 state to STALE (go into DELAY on next packet)

BR:

line: 0
line: 1
line: 2
line: 3
...
line: 115
line: 116
ndp internal: set 2001:db8::fec2:3d00:4:c0ba state to PROBE (probe with 3 unicast NS every 1000000 us)
ndp internal: send neighbor solicitation (iface: 6, src: @!,  tgt: 2001:db8::fec2:3d00:4:c0ba, dst: 2001:db8::fec2:3d00:4:c0ba)
line: 117
...
line: 129
line: 130
ndp internal: send neighbor solicitation (iface: 6, src: @!,  tgt: 2001:db8::fec2:3d00:4:c0ba, dst: 2001:db8::fec2:3d00:4:c0ba)
line: 131
line: 132
...
line: 147
line: 148
ndp internal: send neighbor solicitation (iface: 6, src: @!,  tgt: 2001:db8::fec2:3d00:4:c0ba, dst: 2001:db8::fec2:3d00:4:c0ba)
line: 149
line: 150
...
line: 163
line: 164
ndp internal: set fe80::1 state to DELAY (probe with unicast NS in 5 seconds)
line: 165
...
line: 212
line: 213
ndp: received SL2A (link-layer address: fc:c2:3d:00:00:04:c0:ba)
ndp internal: send neighbor advertisement (iface: 6, tgt: 2001:db8::fec2:3d00:4:abb1, dst: 2001:db8::fec2:3d00:4:c0ba, supply_tl2a: 0)
ndp internal: set 2001:db8::fec2:3d00:4:c0ba state to DELAY (probe with unicast NS in 5 seconds)
line: 214
line: 215
...
line: 279
line: 280
ndp internal: set fe80::1 state to PROBE (probe with 3 unicast NS every 1000000 us)
ndp internal: send neighbor solicitation (iface: 7, src: @!,  tgt: fe80::1, dst: fe80::1)
ndp internal: set fe80::1 state to REACHABLE (reachable time = 24000000 us)
line: 281
line: 282
...
line: 328
line: 329
ndp internal: set 2001:db8::fec2:3d00:4:c0ba state to PROBE (probe with 3 unicast NS every 1000000 us)
ndp internal: send neighbor solicitation (iface: 6, src: @!,  tgt: 2001:db8::fec2:3d00:4:c0ba, dst: 2001:db8::fec2:3d00:4:c0ba)
line: 330
line: 331
...
line: 346
line: 347
ndp internal: send neighbor solicitation (iface: 6, src: @!,  tgt: 2001:db8::fec2:3d00:4:c0ba, dst: 2001:db8::fec2:3d00:4:c0ba)
line: 348
line: 349
...
line: 361
line: 362
ndp internal: send neighbor solicitation (iface: 6, src: @!,  tgt: 2001:db8::fec2:3d00:4:c0ba, dst: 2001:db8::fec2:3d00:4:c0ba)
line: 363
line: 364
...
line: 372
line: 373
ndp: received SL2A (link-layer address: 3a:c6:cd:6c:0c:90)
ndp internal: send neighbor advertisement (iface: 7, tgt: fe80::2, dst: fe80::1, supply_tl2a: 0)
line: 374
line: 375
line: 376
...

@biboc
Copy link
Member Author

biboc commented Oct 20, 2016

missing MAC might let you loose packets

What do you mean? When does the node miss the MAC?

After three solicitations BR removes node global IPV6 from ncache?
Same for the node, after three advertisements without answer from BR, it removes its global IPV6?

@kaspar030
Copy link
Contributor

@biboc Do you have the same issue with debugging disabled?

On the samr21, output over serial is very expensive.

@miri64
Copy link
Member

miri64 commented Oct 20, 2016

missing MAC might let you loose packets

What do you mean? When does the node miss the MAC?

There is no MAC protocol implemented in RIOT yet. That's what I meant with missing MAC, but maybe the answer lies in @kaspar030's remark ;-).

@biboc
Copy link
Member Author

biboc commented Oct 20, 2016

@biboc Do you have the same issue with debugging disabled?

Yes same problem if ND debug is disabled
I'll try to disable more debug like waiting for incoming packet

@biboc
Copy link
Member Author

biboc commented Oct 20, 2016

@miri64 Does anything seems weird on the debug log I gave?

@biboc
Copy link
Member Author

biboc commented Oct 20, 2016

@kaspar030, see, I removed most of the debug (I show only if line%100 == 0)

100
ndp: received SL2A (link-layer address: fc:c2:3d:00:00:04:ab:b1)
ndp internal: send neighbor advertisement (iface: 6, tgt: 2001:db8::fec2:3d00:4:c0ba, dst: 2001:db8::fec2:3d00:4:abb1, supply_tl2a: 0)
ndp internal: set 2001:db8::fec2:3d00:4:abb1 state to DELAY (probe with unicast NS in 5 seconds)
ndp: received SL2A (link-layer address: fc:c2:3d:00:00:04:ab:b1)
ndp internal: send neighbor advertisement (iface: 6, tgt: 2001:db8::fec2:3d00:4:c0ba, dst: 2001:db8::fec2:3d00:4:abb1, supply_tl2a: 0)
ndp: received SL2A (link-layer address: fc:c2:3d:00:00:04:ab:b1)
ndp internal: send neighbor advertisement (iface: 6, tgt: 2001:db8::fec2:3d00:4:c0ba, dst: 2001:db8::fec2:3d00:4:abb1, supply_tl2a: 0)
ndp internal: set 2001:db8::fec2:3d00:4:abb1 state to PROBE (probe with 3 unicast NS every 1000000 us)
ndp internal: send neighbor solicitation (iface: 6, src: @!,  tgt: 2001:db8::fec2:3d00:4:abb1, dst: 2001:db8::fec2:3d00:4:abb1)
6lo nd: address registration successful
ndp internal: set 2001:db8::fec2:3d00:4:abb1 state to REACHABLE (reachable time = 18000000 us)
300
ndp: received SL2A (link-layer address: fc:c2:3d:00:00:04:ab:b1)
ndp internal: send neighbor advertisement (iface: 6, tgt: 2001:db8::fec2:3d00:4:c0ba, dst: 2001:db8::fec2:3d00:4:abb1, supply_tl2a: 0)
ndp: received SL2A (link-layer address: fc:c2:3d:00:00:04:ab:b1)
ndp internal: send neighbor advertisement (iface: 6, tgt: 2001:db8::fec2:3d00:4:c0ba, dst: 2001:db8::fec2:3d00:4:abb1, supply_tl2a: 0)
ndp: received SL2A (link-layer address: fc:c2:3d:00:00:04:ab:b1)
ndp internal: send neighbor advertisement (iface: 6, tgt: 2001:db8::fec2:3d00:4:c0ba, dst: 2001:db8::fec2:3d00:4:abb1, supply_tl2a: 0)
ndp internal: set 2001:db8::fec2:3d00:4:abb1 state to STALE (go into DELAY on next packet)

I don't receive anything afterwards

@miri64 miri64 removed this from the Release 2016.10 milestone Nov 11, 2016
@miri64
Copy link
Member

miri64 commented Jan 6, 2017

@biboc can you check if #5122 (comment) fixes this? This would at least be a quick fix for the release.

neiljay pushed a commit to neiljay/RIOT that referenced this issue Jan 16, 2017
RIOT-2016.10 - Release Notes
============================
RIOT is a real-time multi-threading operating system that supports a range of
devices that are typically found in the Internet of Things: 8-bit
microcontrollers, 16-bit microcontrollers and light-weight 32-bit processors.

RIOT is based on the following design principles: energy-efficiency, real-time
capabilities, small memory footprint, modularity, and uniform API access,
independent of the underlying hardware (this API offers partial POSIX
compliance).

RIOT is developed by an international open source community which is
independent of specific vendors (e.g. similarly to the Linux community) and is
licensed with a non-viral copyleft license (LGPLv2.1), which allows indirect
business models around the free open-source software platform provided by
RIOT.

About this release:
===================
This release provides a lot of new features as well as it  fixes several major
bugs. Among these new features are the new simplified network socket API
called sock, the GNRC specific CoAP implementation gcoap and several new
packages: TinyDTLS, the Aversive++ microcontroller library for robotics, the
u8g2 graphic library, and nanocoap.
Using the new sock API an implementation of the Simple Time Network Protocol
(SNTP) was also introduced, allowing for time synchronization between nodes.
New platforms include the Arduino Uno, the Arduino Duemilanove, the Arduino
Zero, SODAQ Autonomo, and the Zolertia remote (rev. B).
The most significant bug fix was done in native which led to a significantly
more robust handling of ISRs and now allows for at least 1,000 native
instances running stably on one machine.

About 263 pull requests with about 398 commits have been merged since the last
release and about 42 issues have been solved. 37 people contributed with code
in 100 days. 1006 files have been touched with 166500 insertions and 26926
deletions.

Notations used below:
=====================
+ means new feature/item
* means modified feature/item
- means removed feature/item

New features and changes
========================
General
-------
* Verbose behavior for assert() macro

Core
----
+ MPU support for Cortex-M

API changes
-----------
+ Socket-like sock API (replacing conn)
* netdev2: Add Testmodes and CCA modes
* IEEE 802.15.4: clean-up Intra-PAN behavior
* IEEE 802.15.4: centralize default values
* gnrc_pktbuf: allow for 0-sized snips
+ gnrc_netapi: mbox and arbitrary callback support

System libraries
----------------
No new features or changes

Networking
----------
+ Provide sock-port for GNRC
+ gcoap: a GNRC-based CoAP implementation
+ Simple Network Time Protocol (RFC 5905, section 14)
+ Priority Queue for packet snips
+ IPv4 header definitions

Packages
--------
+ nanocoap: CoAP header parser/builder
+ TinyDTLS: DTLS library
+ tiny-asn1: asn.1/der decoder
+ Aversive++ microcontroller programming library
+ u8g2 graphic library

Platforms
---------
+ Support for stm32f2xx MCU family
+ Low power modes for samd21 CPUs
+ More Arduino-based platforms:
    + Arduino Uno
    + Arduino Duemilanove
    + Arduino Zero
+ More boards of ST's Nucleo platforms:
    + ST Nucleo F030 board support
    + ST Nucleo F070 board support
    + ST Nucleo F446 board support
+ SODAQ Automono
+ Zolertia remote rev. B

Drivers
-------
+ W5100 Ethernet device
+ Atmel IO1 Xplained extension
+ LPD8808 LED strips
* at86rf2xx: provide capability to access the RND_VALUE random value register

Build System
------------
+ static-tests build target for easy local execution of CI's static tests

Other
-----
+ Provide Arduino API to Nucleo boards
+ Packer configuration file to build vagrant boxes
+ CC2650STK Debugger Support
+ ethos: add Ethos over TCP support

Fixed Issues from the last release
==================================
 RIOT-OS#534:  native debugging on osx fails
 RIOT-OS#2071: native: *long* overdue fixes
 RIOT-OS#3341: netdev2_tap crashes when hammered
 RIOT-OS#5007: gnrc icmpv6: Ping reply goes out the wrong interface
 RIOT-OS#5432: native: valgrind fails

Known Issues
============
Networking related issues
-------------------------
 RIOT-OS#3075: nhdp: unnecessary microsecond precision: NHDP works with timer values
       of microsecond precision which is not required. Changing to lower
       precision would save some memory.
 RIOT-OS#4048: potential racey memory leak: According to the packet buffer stats,
       flood-pinging a multicast destination may lead to a memory leak due to
       a race condition. However, it seems to be a rare case and a completely
       filled up packet buffer was not observed.
 RIOT-OS#4388: POSIX sockets: open socket is bound to a specific thread: This was an
       inherit problem of the conn API under GNRC. Since the POSIX sockets are
       still based on conn for this release, this issue persists
 RIOT-OS#4527: gnrc_ipv6: Multicast is not forwarded if routing node listens to the
       address (might still be fixable for release, see RIOT-OS#5729, RIOT-OS#5230: gnrc
       ipv6: multicast packets are not dispatched to the upper layers)
 RIOT-OS#5016: gnrc_rpl: Rejoining RPL instance as root after reboot messes up routing
 RIOT-OS#5055: cpuid: multiple radios will get same EUI-64 Nodes with multiple
       interfaces might get the same EUI-64 for them since they are generated
       from the same CPU ID.
 RIOT-OS#5656: Possible Weakness with locking in the GNRC network stack: For some
       operations mutexes to the network interfaces need to get unlocked in
       the current implementation to not get deadlocked. Recursive mutexes as
       provided in RIOT-OS#5731 might help to solve this problem.
 RIOT-OS#5748: gnrc: nodes crashing with too small packet buffer: A packet buffer of
       size ~512 B might lead to crashes. The issue describes this for several
       hundret nodes, but agressive flooding with just two nodes was also
       shown to lead to this problem.
 RIOT-OS#5858: gnrc: 6lo: potential problem with reassembly of fragments: If one frame
       gets lost the reassembly state machine might get out of sync

 ### NDP is not working properly
 RIOT-OS#4499: handle of l2src_len in gnrc_ndp_rtr_sol_handle: Reception of a router
       solicitation might lead to invalid zero-length link-layer addresses in
       neighbor cache.
 RIOT-OS#5005: ndp: router advertisement sent with global address: Under some
       circumstances a router might send RAs with GUAs. While they are ignored
       on receive (as RFC 4861 specifies), RAs should have link-local
       addresses and not even be send out this way.
 RIOT-OS#5122: NDP: global unicast address on non-6LBR nodes disappears after a while:
       Several issues (also see RIOT-OS#5760) lead to a global unicast address
       effectively being banned from the network (disappears from neighbor
       cache, is not added again)
 RIOT-OS#5467: ipv6 address vanishes when ARO (wrongly) indicates DUP caused by
       outdated ncache at router
 RIOT-OS#5539: Border Router: packet not forwarded from ethos to interface 6
 RIOT-OS#5790: ND: Lost of Global IPV6 on node after sending lot of UDP frame from BR

Timer related issues
--------------------
 RIOT-OS#4841: xtimer: timer already in the list: Under some conditions an xtimer can
       end up twice in the internal list of the xtimer module
 RIOT-OS#4902: xtimer: xtimer_set: xtimer_set does not handle integer overflows well
 RIOT-OS#5338: xtimer: xtimer_now() not ISR safe for non-32-bit platforms.
 RIOT-OS#5928: xtimer: usage in board_init() crashes: some boards use the xtimer in
       there board_init() function. The xtimer is however first initialized in
       the auto_init module which is executed after board_init()
 RIOT-OS#6052: tests: xtimer_drift gets stuck: xtimer_drift application freezes after
       ~30-200 seconds

native related issues
---------------------
 RIOT-OS#495:  native not float safe: When the FPU is used when an asynchronous context
       switch occurs, either the stack gets corrupted or a floating point
       exception occurs.
 RIOT-OS#2175: ubjson: valgind registers "Invalid write of size 4" in unittests
 RIOT-OS#4590: pkg: building relic with clang fails.
 RIOT-OS#5796: native: tlsf: early malloc will lead to a crash: TLSF needs pools to be
       initialized (which is currently expected to be done in an application).
       If a malloc is needed before an application's main started (e.g. driver
       initialization) the node can crash, since no pool is allocated yet.

other platform related issues
-----------------------------
 RIOT-OS#1891: newlib-nano: Printf formatting does not work properly for some numberic
       types: PRI[uxdi]64, PRI[uxdi]8 and float are not parsed in newlib-nano
 RIOT-OS#2006: cpu/nrf51822: timer callback may be fired too early
 RIOT-OS#2143: unittests: tests-core doesn't compile for all platforms: GCC build-ins
       were used in the unittests which are not available with msp430-gcc
 RIOT-OS#2300: qemu unittest fails because of a page fault
 RIOT-OS#4512: pkg: tests: RELIC unittests fail on iotlab-m3
 RIOT-OS#4522: avsextrem: linker sometimes doesn't find `bl_init_clks()`
 RIOT-OS#4560: make: clang is more pedantic than gcc oonf_api is not building with
       clang. (Partly solved by RIOT-OS#4593)
 RIOT-OS#4694: drivers/lm75a: does not build
 RIOT-OS#4737: cortex-m: Hard fault after a thread exits (under some circumstances)
 RIOT-OS#4822: kw2xrf: packet loss when packets get fragmented
 RIOT-OS#4876: at86rf2xx: Simultaneous use of different transceiver types is not
       supported
 RIOT-OS#4954: chronos: compiling with -O0 breaks
 RIOT-OS#4866: not all GPIO driver implementations are thread safe: Due to non-atomic
       operations in the drivers some pin configurations might get lost.
 RIOT-OS#5009: RIOT is saw-toothing in energy consumption (even when idling)
 RIOT-OS#5103: xtimer: weird behavior of tests/xtimer_drift: xtimer_drift randomly
       jumps a few seconds on nrf52
 RIOT-OS#5361: cpu/cc26x0: timer broken
 RIOT-OS#5405: Eratic timings on iotlab-m3 with compression context activated
 RIOT-OS#5460: cpu/samd21: i2c timing with compiler optimization
 RIOT-OS#5486: at86rf2xx: lost interrupts
 RIOT-OS#5489: cpu/lpc11u34: ADC broken
 RIOT-OS#5603: atmega boards second UART issue
 RIOT-OS#5678: at86rf2xx: failed assertion in _isr
 RIOT-OS#5719: cc2538: rf driver doesn't handle large packets
 RIOT-OS#5799: kw2x: 15.4 duplicate transmits
 RIOT-OS#5944: msp430: ipv6_hdr unittests fail
 RIOT-OS#5848: arduino: Race condition in sys/arduino/Makefile.include
 RIOT-OS#5954: nRF52 uart_write get stuck
 RIOT-OS#6018: nRF52 gnrc 6lowpan ble memory leak

other issues
------------
 RIOT-OS#1263: TLSF implementation contains (a) read-before-write error(s).
 RIOT-OS#3256: make: Setting constants on compile time doesn't really set them
       everywhere
 RIOT-OS#3366: periph/i2c: handle NACK
 RIOT-OS#4488: Making the newlib thread-safe: When calling puts/printf after
       thread_create(), the CPU hangs for DMA enabled uart drivers.
 RIOT-OS#4866: periph: GPIO drivers are not thread safe
 RIOT-OS#5128: make: buildtest breaks when exporting FEATURES_PROVIDED var
 RIOT-OS#5207: make: buildest fails with board dependent application Makefiles
 RIOT-OS#5390: pkg: OpenWSN does not compile: This package still uses deprecated
       modules and was not tested for a long time.
 RIOT-OS#5520: tests/periph_uart not working
 RIOT-OS#5561: C++11 extensions in header files
 RIOT-OS#5776: make: Predefining CFLAGS are parsed weirdly
 RIOT-OS#5863: OSX +  SAMR21-xpro: shell cannot handle command inputs larger than 64
       chars
 RIOT-OS#5962: Makefile: UNDEF variable is not working as documented
 RIOT-OS#6022: pkg: build order issue

Special Thanks
==============
We like to give our special thanks to all the companies that provided us with
their hardware for porting and testing, namely the people from (in
alphabeticalorder): Atmel, Freescale, Imagination Technologies, Limifrog,
Nordic, OpenMote, Phytec, SiLabs, UDOO,and Zolertia; and also companies that
directly sponsored development time: Cisco Systems, Eistec, Ell-i, Enigeering
Spirit, Nordic, FreshTemp LLC, OTAkeys and Phytec.

More information
================
http://www.riot-os.org

Mailing lists
-------------
* RIOT OS kernel developers list
  devel@riot-os.org (http://lists.riot-os.org/mailman/listinfo/devel)
* RIOT OS users list
  users@riot-os.org (http://lists.riot-os.org/mailman/listinfo/users)
* RIOT commits
  commits@riot-os.org (http://lists.riot-os.org/mailman/listinfo/commits)
* Github notifications
  notifications@riot-os.org (http://lists.riot-os.org/mailman/listinfo/notifications)

IRC
---
* Join the RIOT IRC channel at: irc.freenode.net, #riot-os

License
=======
* Most of the code developed by the RIOT community is licensed under the GNU
  Lesser General Public License (LGPL) version 2.1 as published by the Free
  Software Foundation.
* Some external sources are published under a separate, LGPL compatible
  license (e.g. some files developed by SICS).

All code files contain licensing information.
@PeterKietzmann PeterKietzmann modified the milestones: Release 2017.01, Release 2017.04 Jan 26, 2017
@kYc0o
Copy link
Contributor

kYc0o commented Apr 26, 2017

@biboc ping.

@biboc
Copy link
Member Author

biboc commented Apr 26, 2017

I'm sorry @kYc0o , I won't be able to test this for now.

@miri64
Copy link
Member

miri64 commented Nov 6, 2017

Fixed by #7925

@biboc
Copy link
Member Author

biboc commented Nov 7, 2017

Great, I believe you!

@miri64
Copy link
Member

miri64 commented Nov 27, 2017

After pondering this a little bit, I I'm not sure if this is actually a conjestion issue (which can't be fixed with normal IPv6). Can you retry?

@miri64
Copy link
Member

miri64 commented Nov 28, 2017

Testing now with

while true; do
    sleep 0.1
    echo "Hello World!" | nc -u 2001:db8::7b62:3323:ec77:c86 1337 -w0
done

From Linux host.

@miri64
Copy link
Member

miri64 commented Nov 28, 2017

(I'll run it a bit longer, but sniffer already tells me, that the ND messages come through clear as day, both for link-local addresses and GUAs)

@biboc
Copy link
Member Author

biboc commented Nov 28, 2017

Great if you solve it!
Congrats for your job on network @miri64

@miri64
Copy link
Member

miri64 commented Nov 28, 2017

Ran for over an our now and multiple updates to the neighbor cache and default router list happened on both border and downstream router. I think we can deem this issue fixed.

@miri64 miri64 closed this as completed Nov 28, 2017
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Area: network Area: Networking Type: bug The issue reports a bug / The PR fixes a bug (including spelling errors)
Projects
None yet
Development

No branches or pull requests

6 participants