Skip to content
GitLab
Menu
Projects
Groups
Snippets
Loading...
Help
Help
Support
Community forum
Keyboard shortcuts
?
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
Menu
Open sidebar
tsoc
openmm
Commits
435be5e2
Commit
435be5e2
authored
Jan 15, 2014
by
peastman
Browse files
Workaround for bug in Visual Studio
parent
c285b600
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
12 additions
and
4 deletions
+12
-4
openmmapi/include/openmm/internal/vectorize8.h
openmmapi/include/openmm/internal/vectorize8.h
+12
-4
No files found.
openmmapi/include/openmm/internal/vectorize8.h
View file @
435be5e2
...
@@ -9,7 +9,7 @@
...
@@ -9,7 +9,7 @@
* Biological Structures at Stanford, funded under the NIH Roadmap for *
* Biological Structures at Stanford, funded under the NIH Roadmap for *
* Medical Research, grant U54 GM072970. See https://simtk.org. *
* Medical Research, grant U54 GM072970. See https://simtk.org. *
* *
* *
* Portions copyright (c) 2013 Stanford University and the Authors.
*
* Portions copyright (c) 2013
-2014
Stanford University and the Authors. *
* Authors: Peter Eastman *
* Authors: Peter Eastman *
* Contributors: *
* Contributors: *
* *
* *
...
@@ -201,13 +201,21 @@ static inline void transpose(const fvec4& in1, const fvec4& in2, const fvec4& in
...
@@ -201,13 +201,21 @@ static inline void transpose(const fvec4& in1, const fvec4& in2, const fvec4& in
fvec4
i5
=
in5
,
i6
=
in6
,
i7
=
in7
,
i8
=
in8
;
fvec4
i5
=
in5
,
i6
=
in6
,
i7
=
in7
,
i8
=
in8
;
_MM_TRANSPOSE4_PS
(
i1
,
i2
,
i3
,
i4
);
_MM_TRANSPOSE4_PS
(
i1
,
i2
,
i3
,
i4
);
_MM_TRANSPOSE4_PS
(
i5
,
i6
,
i7
,
i8
);
_MM_TRANSPOSE4_PS
(
i5
,
i6
,
i7
,
i8
);
#ifdef _MSC_VER
// Visual Studio has a bug in _mm256_castps128_ps256, so we have to use the more expensive _mm256_insertf128_ps.
out1
=
_mm256_insertf128_ps
(
out1
,
i1
,
0
);
out2
=
_mm256_insertf128_ps
(
out2
,
i2
,
0
);
out3
=
_mm256_insertf128_ps
(
out3
,
i3
,
0
);
out4
=
_mm256_insertf128_ps
(
out4
,
i4
,
0
);
#else
out1
=
_mm256_castps128_ps256
(
i1
);
out1
=
_mm256_castps128_ps256
(
i1
);
out1
=
_mm256_insertf128_ps
(
out1
,
i5
,
1
);
out2
=
_mm256_castps128_ps256
(
i2
);
out2
=
_mm256_castps128_ps256
(
i2
);
out2
=
_mm256_insertf128_ps
(
out2
,
i6
,
1
);
out3
=
_mm256_castps128_ps256
(
i3
);
out3
=
_mm256_castps128_ps256
(
i3
);
out3
=
_mm256_insertf128_ps
(
out3
,
i7
,
1
);
out4
=
_mm256_castps128_ps256
(
i4
);
out4
=
_mm256_castps128_ps256
(
i4
);
#endif
out1
=
_mm256_insertf128_ps
(
out1
,
i5
,
1
);
out2
=
_mm256_insertf128_ps
(
out2
,
i6
,
1
);
out3
=
_mm256_insertf128_ps
(
out3
,
i7
,
1
);
out4
=
_mm256_insertf128_ps
(
out4
,
i8
,
1
);
out4
=
_mm256_insertf128_ps
(
out4
,
i8
,
1
);
}
}
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
.
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment