- 
                Notifications
    You must be signed in to change notification settings 
- Fork 305
          Use Into<Message> in signing api
          #755
        
          New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
87aee8c    to
    50df818      
    Compare
  
    | Any tips for debugging WASM and ASAN? | 
| WASM fail is unrelated. I don't know what is causing the ASAN fail, from looking at the changeset I'd hazard a guess it is also unrelated. | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ACK 50df818
| In 50df818: I am seeing a test failure with  This is the first time I'm trying to test with  But code-reveiw ack. | 
| Wow, good catch @apoelstra. Managed to fix it with this change: c0937e3 Not sure what was wrong with the previous code, FFI wasn't happy reusing the same msg pointer.. Curious if you have any ideas? Seems like  edit: this also fixed ASAN. | 
| 
 I think we just got lucky and what we really need is Miri testing -- this fix does not look like a normal release/debug issue but rather a symptom of UB in our code. Sadly we will probably never be able to Miri-test this crate because Miri can't handle FFI calls. Maybe we can try running tests in valgrind. A long time ago that didn't work because of bugs in rustc but maybe those are resolved. If none of that works I suppose we can add tests with  BTW can you sqash the two commits together? I would prefer not to have any commits in history with UB, to the extent possible :). | 
c0937e3    to
    de3b93d      
    Compare
  
    | let mut entropy_p: *const ffi::types::c_void = ptr::null(); | ||
| let mut counter: u32 = 0; | ||
| let mut extra_entropy = [0u8; 32]; | ||
| let msg = msg.into(); | 
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't get why we have a local var here but on line 262 we just chain the calls? Did you mean to change both?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We need the local variable to ensure that the object lives through the entire time that we are using the pointer.
If we're still chaining the calls on line 262 then that also needs to be changed. Thanks for beating me to the review!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks, I thought as much.
Out of interest how can the value not exist the entire time we are using it when its pass by value, so is part of the stack frame, calling Into is just borrow checker stuff and as_c_ptr is type checker stuff, neither of which effects the actual data (I think). Then the value is used in a single function call (the ffi function call).
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm not sure what you mean by Into being "just borrow checker stuff". Into::into takes an abstract message by value and returns a Message by value. Then as_c_ptr borrows this object and returns a raw pointer whose lifetime must not exceed the lifetime of the Message. The borrow checker is barely involved with any of this, and even if it was, it can only do sanity checks; it never affects the semantics of code.
But if we don't give the Message a variable binding, its lifetime will only consist of one line of code. If we give it a binding it'll live until the end of the function.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@tcharding note that this is the same underlying issue as this lint: https://doc.rust-lang.org/rustc/lints/listing/warn-by-default.html#temporary-cstring-as-ptr
(one day we can write an attribute to add these lints for user types)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is just musings and for my education, so please only respond if it amuses you to do so.
When sign_ecdsa_with_noncedata_pointer is compiled is the parametr msg: impl Into<Message>, just 32 bytes in the stack frame?
I'm not sure what you mean by Into being "just borrow checker stuff".
I misspoke, FTR I don't know the exact correct terminology for all the parses of the compiler.
I read the link above but that is different in that a str has to have memory backing it but in our case I thought the memory backing it would be the 32 bytes in the stack frame that was passed in as msg (in function sign_ecdsa_with_noncedata_pointer) - so I can't understand why creating a local variable is fixing the problem.
Said another way, I get that at the end of msg.into().as_c_ptr() that there are no guarantees that the pointer is valid, but I don't get why we cannot tell it is valid because we know where the value is on the stack already because it was passed in with the function.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
(I think I can map C functions to opcodes but I do not know exactly how to map Rust functions to opcodes.)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't get why we have a local var here but on line 262 we just chain the calls? Did you mean to change both?
@tcharding it's because here it's used in a loop. impl Into<Message> doesn't implement Copy, so if we .into it inside the loop we lose ownership and cannot proceed with more iterations.
 
By calling .into outside of the loop, we get the Message type which implements Copy so it can be used inside the loop as many times as needed.
We need the local variable to ensure that the object lives through the entire time that we are using the pointer.
If we're still chaining the calls on line 262 then that also needs to be changed. Thanks for beating me to the review!
@apoelstra interesting. So once Rust passes .into().as_c_ptr() to the ffi, it marks the memory as unused and so that memory could be recycled before the C code finishes doing what it needs with that memory slice? I wonder if that is what was causing the UB in the release build.
I've updated all instances of chaining to have the .into() in a higher scope so the memory is held onto.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I read the link above but that is different in that a
strhas to have memory backing it but in our case I thought the memory backing it would be the 32 bytes in the stack frame
How could it be? The 32 bytes in the stack frame belong to an object of a completely different type (an opaque Into<Message> vs a Message). And we are telling the compiler that we don't need the Into<Message> anymore and it can reclaim the memory.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I need to read a book on how the Rust compiler works. Thanks for your patience.
fix: release test
de3b93d    to
    ec0a69f      
    Compare
  
    There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ACK ec0a69f
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
ACK ec0a69f; successfully ran local tests; I wonder if we should do the same thing with SecretKey
… api ec0a69f41931d598cdafe0e9d709f9530b7771c5 Use Into<Message> in signing api (Liam Aharon) Pull request description: Closes #700 (also see rust-bitcoin/rust-bitcoin#2821). Unrelated question while I have the authors attention: the schnorr apis (e.g. https://github.com/liamaharon/rust-secp256k1/blob/9afbf5111113ce84ff6f3b52f37c60554af2c283/secp256k1-sys/src/lib.rs#L81-L82) accepts a param named `msg32`, then directly after `msg_len`. I find it confusing since I would assume from the name `msg32` it must be 32 bytes. Should those instances be renamed `msg` if it is indeed variable len? ACKs for top commit: tcharding: ACK ec0a69f41931d598cdafe0e9d709f9530b7771c5 apoelstra: ACK ec0a69f41931d598cdafe0e9d709f9530b7771c5; successfully ran local tests; I wonder if we should do the same thing with SecretKey Tree-SHA512: 2c3d9987d41f15ac91b0a77595b4d238cd52d9484a461f7541b2fd2e03b86f908ae7b34c76c1239501eb67246ba336161be1c6ae41d7f84122e9c003a3ade7d8
… api ec0a69f41931d598cdafe0e9d709f9530b7771c5 Use Into<Message> in signing api (Liam Aharon) Pull request description: Closes #700 (also see rust-bitcoin/rust-bitcoin#2821). Unrelated question while I have the authors attention: the schnorr apis (e.g. https://github.com/liamaharon/rust-secp256k1/blob/9afbf5111113ce84ff6f3b52f37c60554af2c283/secp256k1-sys/src/lib.rs#L81-L82) accepts a param named `msg32`, then directly after `msg_len`. I find it confusing since I would assume from the name `msg32` it must be 32 bytes. Should those instances be renamed `msg` if it is indeed variable len? ACKs for top commit: tcharding: ACK ec0a69f41931d598cdafe0e9d709f9530b7771c5 apoelstra: ACK ec0a69f41931d598cdafe0e9d709f9530b7771c5; successfully ran local tests; I wonder if we should do the same thing with SecretKey Tree-SHA512: 2c3d9987d41f15ac91b0a77595b4d238cd52d9484a461f7541b2fd2e03b86f908ae7b34c76c1239501eb67246ba336161be1c6ae41d7f84122e9c003a3ade7d8
Closes #700 (also see rust-bitcoin/rust-bitcoin#2821).
Unrelated question while I have the authors attention: the schnorr apis (e.g. https://github.com/liamaharon/rust-secp256k1/blob/9afbf5111113ce84ff6f3b52f37c60554af2c283/secp256k1-sys/src/lib.rs#L81-L82) accepts a param named
msg32, then directly aftermsg_len. I find it confusing since I would assume from the namemsg32it must be 32 bytes. Should those instances be renamedmsgif it is indeed variable len?