[Proposal] Allow to create arrays without initializing them to 0 #1037

msedi · 2017-10-23T22:05:42Z

msedi
Oct 23, 2017

I must admit I haven't had the possibility to test it, since there is no way out of C# nor MSIL to create an array without initializing it to 0.

We do a lot of number crunching and during processing data, we need need to allocate a lot of data. Mostly images of 512x512 that are linearized to a 1D-array.

What happens when I allocate an array in C# is that it's always initialized to 0 although I know, that immediately after creating the array, I will set the contents to a defined value, other then 0.

In a regular scenario this might be practical but in our envrionment I assume zeroing the contents of the array is not negligible in terms of perfomance when it comes to a lot of data.

I would be very interested about the performance (leaving out zeroing the array). And if possible - and here comes the proposal -

extend creating an array creation to leave out zeroing

I understand this might be a little hard in C#, but it would be nice to have an extension in MSIL for newarr, either as something newarr.nz or as an optional argument.

Answered by msedi

Oct 19, 2020

It seems that the problem is now solved with .NET5 and GC.AllocateUnitializedArray<T>.

View full answer

Joe4evr · 2017-10-23T22:15:46Z

Joe4evr
Oct 23, 2017

Dupe of/related to #868.

0 replies

msedi · 2017-10-23T22:23:25Z

msedi
Oct 23, 2017
Author

@Joe4evr: Yes, related to #868 but in #868 it's only for stackalloc (MSIL: localloc), I somehow need it for MSIL newarr. Do you think I should close this one a refer to #868?

0 replies

Joe4evr · 2017-10-23T22:27:12Z

Joe4evr
Oct 23, 2017

While that issue is raised about stackalloc initially, the discussion itself is largely about why the runtime zeroes out memory in the first place, which is what you'll be up against regardless.

0 replies

scalablecory · 2017-10-23T23:02:43Z

scalablecory
Oct 23, 2017

@msedi consider using the System.Buffers package for this.

0 replies

svick · 2017-10-23T23:30:24Z

svick
Oct 23, 2017
Collaborator

@scalablecory Do you mean the System.Buffers namespace in the pre-release version of the System.Memory package (which includes types related to Memory<T>)?

0 replies

svick · 2017-10-23T23:55:33Z

svick
Oct 23, 2017
Collaborator

My questions:

Would avoiding zero-initialization actually significantly improve performance for you? Consider that when allocating arrays in C#, the most significant cost is usually considered to be the garbage collection required to deallocate it. Wouldn't the GC cost dwarf the zero-initialization cost?
Are you filling the arrays using some simple pattern? Maybe the JIT could detect this situation and avoid zero-initialization in such cases.
Would adding a method that returns an uninitialized array be enough? It could be e.g. Array.CreateUninitialized<T>().
Would using Span<T> or Memory<T> that is backed by a pointer to an uninitialized buffer (allocated using something like Marshal.AllocHGlobal()) instead of array work for you?

Span<T> is already released and converting pointers to it is trivial, but it has significant limitations due to being stack-only.

Memory<T> hasn't been released yet and, as far as I can tell, converting pointers to it requires fairly complicated custom class, but it's just a normal class, it has no stack-related limitations.

0 replies

MkazemAkhgary · 2017-10-24T03:55:53Z

MkazemAkhgary
Oct 24, 2017

You are talking about performance where you think zeroing it an initialization is problem?

You better move to c++ or write in c for gpu and take advantage of parallelism.

C# does not serve you very well.

0 replies

scalablecory · 2017-10-24T04:19:13Z

scalablecory
Oct 24, 2017

@svick no, I meant the package I linked to. The workload @msedi describes is exactly what the ArrayPool class in System.Buffers is designed for.

It will reduce GC pressure and has an option to not clear the arrays when you're done with them.

0 replies

msedi · 2017-10-24T07:57:02Z

msedi
Oct 24, 2017
Author

@svick: Thanks for the suggestions, I try to answer as best as I can:

Would avoiding zero-initialization actually significantly improve performance for you?

In fact I'm not sure, because there was no way to try. The newarr (MSIL) or new (C#) always do the initialization and I couldn't get around this. If someone has the option to try that would be great,

Are you filling the arrays using some simple pattern? Maybe the JIT could detect this situation and avoid zero-initialization in such cases.

No, indeed not. The pattern is completely random. To make it a little bit more clear, I'm taling about radiological images that either have a size of 512x512 or much higher, up 4096x4096. Since there is also random noise on it, Even if the baseline pattern doesn't change the noise still does.

Would adding a method that returns an uninitialized array be enough? It could be e.g. Array.CreateUninitialized().

Depends on question 1, If zeroing the array has a tremendous impact and calling the function hasn't that would be of course ok.

Would using Span or Memory that is backed by a pointer to an uninitialized buffer (allocated using something like Marshal.AllocHGlobal()) instead of array work for you?

As you already said, Span is created via stackalloc, but the memory size we are dealing are beyond the stack size. I was also thinking about the unmanaged Marshal.AllocHGlobal approach. But all our mathematical routines are based on conventional C# arrays. I would have to rewrite all array mathods to work with pointers also.

There comes another issue the fairly often comes up. There is currently not mechanism in C# to restrict to some numeric type. That's one reason why all my math/vector algebra is very very long code.

At least for some basic methods (like Add), I wrote my own Add implementation which is currently not typesafe due to the limitations that I'm not able to restrict the T as numeric type.

I will have a look at span and memory.

0 replies

msedi · 2017-10-24T08:01:43Z

msedi
Oct 24, 2017
Author

@MkazemAkhgary: We are already using CUDA and C++/MKL for some high performance things. But in my opinion some things can still be optimized in C#, since I'm not very happy moving to other languages only to a few small restrictions.

Also using CUDA is causing a lot of pain. Since the CUDA compilers are attached to some C++ compiler and to several platforms. With every new CUDA version we have to exchange GPUs at the customers site only because the compiler toolkits are not backward compatible.

With C#/.NET I can make sure that it works for a longer time.... ;-)

0 replies

msedi · 2017-10-24T08:02:21Z

msedi
Oct 24, 2017
Author

@scalablecory: Thanks. I will ahev a look at the ArrayPool and will return here if I have further details.

0 replies

msedi · 2017-10-24T08:50:02Z

msedi
Oct 24, 2017
Author

@svick: Do you know who is responsible for ArrayPool or where I can ask questions or have suggestions for the ArrayPool?

BTW: The ArrayPool always returns a multiple of some size and not the size I have requested. From the internals I understand the reason, But for the "external" user it would be helpful to have something returns like an ArraySegment or better a derivation of it. The only problem is that the ArraySegment is slow.

So maybe the Span or Memory might help?

0 replies

svick · 2017-10-24T10:05:28Z

svick
Oct 24, 2017
Collaborator

@msedi

In fact I'm not sure, because there was no way to try.

Have you tried profiling your code? I think a profiler should be able to tell you how much time is spent in GC, and compare it with how much time is spent allocating the arrays (which includes the time it takes to zero-initialize them).

As you already said, Span is created via stackalloc, but the memory size we are dealing are beyond the stack size.

No, Span<T> itself is stack-only, but it can point to a buffer that is not stack-allocated.

But all our mathematical routines are based on conventional C# arrays. I would have to rewrite all array mathods to work with pointers also.

I meant to avoid directly using pointers and instead rewrite your methods using Span<T> (or a combination of Memory<T> and Span<T>). Since Span<T> is similar to an array (e.g. it has a Length), that rewrite might not be that hard, unless you heavily use Array-specific framework methods).

Do you know who is responsible for ArrayPool or where I can ask questions or have suggestions for the ArrayPool?

That would be the corefx repo, since that's where the code is.

0 replies

msedi · 2017-10-24T10:43:51Z

msedi
Oct 24, 2017
Author

@svick : Thanks for the info. The only problem is that it seems Span and Memory are currently written in C# 7.2 or c# 8.0 which I cannot use in productive code, yet, right?

0 replies

msedi · 2020-10-19T10:48:19Z

msedi
Oct 19, 2020
Author

It seems that the problem is now solved with .NET5 and GC.AllocateUnitializedArray<T>.

0 replies

[Proposal] Allow to create arrays without initializing them to 0 #1037

Uh oh!

msedi Oct 23, 2017

Replies: 15 comments

Uh oh!

Joe4evr Oct 23, 2017

Uh oh!

msedi Oct 23, 2017 Author

Uh oh!

Joe4evr Oct 23, 2017

Uh oh!

scalablecory Oct 23, 2017

Uh oh!

svick Oct 23, 2017 Collaborator

Uh oh!

svick Oct 23, 2017 Collaborator

Uh oh!

MkazemAkhgary Oct 24, 2017

Uh oh!

scalablecory Oct 24, 2017

Uh oh!

msedi Oct 24, 2017 Author

Uh oh!

msedi Oct 24, 2017 Author

Uh oh!

msedi Oct 24, 2017 Author

Uh oh!

msedi Oct 24, 2017 Author

Uh oh!

svick Oct 24, 2017 Collaborator

Uh oh!

msedi Oct 24, 2017 Author

Uh oh!

msedi Oct 19, 2020 Author

msedi
Oct 23, 2017

Joe4evr
Oct 23, 2017

msedi
Oct 23, 2017
Author

Joe4evr
Oct 23, 2017

scalablecory
Oct 23, 2017

svick
Oct 23, 2017
Collaborator

svick
Oct 23, 2017
Collaborator

MkazemAkhgary
Oct 24, 2017

scalablecory
Oct 24, 2017

msedi
Oct 24, 2017
Author

msedi
Oct 24, 2017
Author

msedi
Oct 24, 2017
Author

msedi
Oct 24, 2017
Author

svick
Oct 24, 2017
Collaborator

msedi
Oct 24, 2017
Author

msedi
Oct 19, 2020
Author