[stdlib_linalg] Add empty function. #477

zoziha · 2021-07-30T10:54:59Z

add empty function. (see numpy/empty)

Tasks

ready to allocatable solution. (see [stdlib_linalg] Add empty function. #477 (comment))

More routines to do, not this PR

see #476.

src/tests/linalg/test_linalg_empty.f90

doc/specs/stdlib_linalg.md

awvwgk · 2021-07-31T21:30:29Z

The main purpose of numpy.empty would be to create an allocation on the heap and return its reference. For a similar mechanism in Fortran we would have to use the pointer attribute, also we cannot assign in such case:

implicit none
real, pointer :: array(:)
integer :: n = 6000000

array => empty_ptr(n)
! ... do something with array
deallocate(array)  ! explicit free required
contains
   function empty_ptr(n) result(array)
      integer, intent(in) :: n
      real, pointer :: array(:)
      allocate(array(n))
   end function
end

The current implementation uses an automatic array to trigger an automatic LHS (re)allocation. This has several drawback, first the automatic array will most likely be created on the stack and cannot be moved to an allocatable array on the heap, but must be copied to the newly allocated array. This will be considerably worse than just using allocate. Also large stack arrays allocated by this mean are at risk of overflowing the stack, which leads to hard to debug stack overflows.

Preferably, to match the original intent of the functionality provided by numpy.empty to provide a functional access to the allocate procedure without introducing additional overhead, the implementation would be along the lines of:

function empty(n) result(array)
  integer, intent(in) :: n
  real, allocatable :: array(:)
  allocate(array(n))
end function empty

In the best case the compiler might move the allocation produced by the function to the LHS array instead of separately allocating and copying the uninitialized array.

zoziha · 2021-08-01T03:23:13Z

In the best case the compiler might move the allocation produced by the function to the LHS array instead of separately allocating and copying the uninitialized array.

Take a look at these two examples @awvwgk , I see from examples here, the current code implementation is more efficient in gfortran (not ifort), and the code implementation method you mentioned can also be (re)allocated.

Example 1

In the case of small stack usage, gfortran is more efficient and ifort is less efficient for the current implementation. (Um..)

program test

    real :: stime, etime
    real(kind=8), allocatable :: A(:,:), B(:,:)

    call cpu_time(stime)
    A = empty1(100,100)
    A = empty1(200,200)
    call cpu_time(etime)
    print *, "etime - stime (seconds) : ", etime - stime !! 3.4e-4 (gfortran); 1.5e-5 (ifx); 5.5e-4 (ifort)

    call cpu_time(stime)
    B = empty2(100,100)
    B = empty2(200,200)
    call cpu_time(etime)
    print *, "etime - stime (seconds) : ", etime - stime !! 8.0e-6 (gfortran); 4.3e-4 (ifx); 4.5e-4 (ifort)

contains

    pure function empty1(ndim1, ndim2) result(result)
        implicit none
        integer, intent(in) :: ndim1, ndim2
        real(kind=8), allocatable :: result(:,:)
        allocate(result(ndim1, ndim2))
    end function empty1

    pure function empty2(ndim1, ndim2) result(result)
        implicit none
        integer, intent(in) :: ndim1, ndim2
        real(kind=8) :: result(ndim1, ndim2)
    end function empty2

end program test

Example 2

In the case of large stack usage, gfortran is more efficient as well and ifort stack overflows for the current implementation. (Um.. +1)

program test

    real :: stime, etime
    real(kind=8), allocatable :: A(:,:), B(:,:)

    call cpu_time(stime)
    A = empty1(1000,1000)
    A = empty1(2000,2000)
    call cpu_time(etime)
    print *, "etime - stime (seconds) : ", etime - stime !! 3.3e-2 (gfortran); 2.2e-5 (ifx); 5.3e-2 (ifort)

    call cpu_time(stime)
    B = empty2(1000,1000)
    B = empty2(2000,2000)
    call cpu_time(etime)
    print *, "etime - stime (seconds) : ", etime - stime !! 1.6e-5 (gfortran); stack overflow (ifx/ifort) 

contains

    pure function empty1(ndim1, ndim2) result(result)
        implicit none
        integer, intent(in) :: ndim1, ndim2
        real(kind=8), allocatable :: result(:,:)
        allocate(result(ndim1, ndim2))
    end function empty1

    pure function empty2(ndim1, ndim2) result(result)
        implicit none
        integer, intent(in) :: ndim1, ndim2
        real(kind=8) :: result(ndim1, ndim2)
    end function empty2

end program test

zoziha · 2021-08-01T04:23:33Z

I think based on the above examples, to be compatible with ifort's compiler and maintain the robustness of empty, we use the allocatable solution seems to be a better choice.

1. automatic array -> allocatable array. 2. add `empty` tests for `real/complex` type.

milancurcic · 2021-08-22T17:11:00Z

doc/specs/stdlib_linalg.md

+program demo_linlag_empty_1
+
+    use stdlib_linlag, only: empty


Suggested change

program demo_linlag_empty_1

use stdlib_linlag, only: empty

program demo_linalg_empty_1

use stdlib_linalg, only: empty

milancurcic

Like with eye, I don't think that the allocatable result is a better choice here over the automatic array and the examples show it. However, the difference is probably low impact because this is likely to be used as a convenience function rather than a high-performance one, and in my view UX > performance. I think this can go forward as is.

awvwgk · 2021-08-22T17:56:06Z

The only use case I see for this function is to trigger an automatic LHS allocation, which I think is rather limited compared to the intrinsic function, because it will always require a statement to make use of this.

allocate(array(n, n))
array = empty(n, n)

Using it in an expression would only work when multiplying by zero, because the initial entries are undefined, but for this purpose we could use zeros instead. Also, in this context the result could change depending on compiler options, e.g. when requesting the compiler to initialize all reals with signalling NaNs.

I can see why this function is required in numpy to return a pointer to a memory allocation, but I don't think there is need for it in Fortran.

zoziha · 2021-08-23T13:56:33Z

I think the purpose of empty is indeed not clear at present, and I agree to cancel this PR.
We can just use allocate(array(m, n)) in Fortran.

zoziha mentioned this pull request Jul 30, 2021

[feature] Add more base routines for stdlib_linalg #476

Open

8 tasks

Add empty function.

e902823

zoziha commented Jul 30, 2021

View reviewed changes

src/tests/linalg/test_linalg_empty.f90 Outdated Show resolved Hide resolved

jvdp1 reviewed Jul 30, 2021

View reviewed changes

doc/specs/stdlib_linalg.md Show resolved Hide resolved

zoziha mentioned this pull request Jul 31, 2021

[stdlib_linalg] Add zeros, ones function. #478

Closed

2 tasks

awvwgk added the reviewers needed This patch requires extra eyes label Jul 31, 2021

zoziha added 2 commits August 3, 2021 10:43

Update empty function:

d2296a7

1. automatic array -> allocatable array. 2. add `empty` tests for `real/complex` type.

Improve empty func.

d57e386

zoziha mentioned this pull request Aug 18, 2021

[stdlib_linalg] Update eye function. #481

Merged

3 tasks

milancurcic reviewed Aug 22, 2021

View reviewed changes

milancurcic approved these changes Aug 22, 2021

View reviewed changes

milancurcic closed this Aug 23, 2021

awvwgk removed the reviewers needed This patch requires extra eyes label Sep 25, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[stdlib_linalg] Add empty function. #477

[stdlib_linalg] Add empty function. #477

zoziha commented Jul 30, 2021 •

edited

Loading

awvwgk commented Jul 31, 2021

zoziha commented Aug 1, 2021 •

edited

Loading

zoziha commented Aug 1, 2021

milancurcic Aug 22, 2021

milancurcic left a comment •

edited

Loading

awvwgk commented Aug 22, 2021

zoziha commented Aug 23, 2021 •

edited

Loading

[stdlib_linalg] Add empty function. #477

[stdlib_linalg] Add empty function. #477

Conversation

zoziha commented Jul 30, 2021 • edited Loading

Tasks

More routines to do, not this PR

awvwgk commented Jul 31, 2021

zoziha commented Aug 1, 2021 • edited Loading

Example 1

Example 2

zoziha commented Aug 1, 2021

milancurcic Aug 22, 2021

Choose a reason for hiding this comment

milancurcic left a comment • edited Loading

Choose a reason for hiding this comment

awvwgk commented Aug 22, 2021

zoziha commented Aug 23, 2021 • edited Loading

zoziha commented Jul 30, 2021 •

edited

Loading

zoziha commented Aug 1, 2021 •

edited

Loading

milancurcic left a comment •

edited

Loading

zoziha commented Aug 23, 2021 •

edited

Loading