GoboLinux Downloads Documentation Community Recipes Screenshots

GoboLinux Recipe & Package Search Tool

71 versions of Linux.

ProgramAgeSizeByWWWSummary
Linux 4.13.2-r1 925  88794 Luca...
The Linux Kernel.
Linux 4.9.16-r3 1096  100651 Luca...
The Linux Kernel.
Linux 4.9.4-r5 1152  99374 Luca...
The Linux Kernel.
Linux 4.8.2-r2 1218  89394 Luca...
The Linux Kernel.
Linux 4.7.4-r1 1288  82767 Luca...
The Linux Kernel.
Linux 4.7.0-r4 1343  82585 Luca...
The Linux Kernel.
Linux 3.13.3-r1 2223  252629 Luca...
The Linux Kernel.
Linux 3.12.6-r1 2283  238949 Luca...
The Linux Kernel.
Linux 3.9.4-r2 2478  70048 Luca...
The Linux Kernel.
Linux 3.7.1-r2 2651  67579 Luca...
The Linux Kernel.
Linux 3.5.0-r1 2802  124391 Luca...
The Linux Kernel.
Linux 3.4.4-r1 2802  124348 Luca...
The Linux Kernel.
Linux 3.3.6-r1 2802  124410 Luca...
The Linux Kernel.
Linux 3.2.12-r2 2930  124345 Luca...
The Linux Kernel.
Linux 3.2.7-r1 2960  123550 Mich...
The Linux Kernel.
Linux 3.1.1-r1 3065  122907 Mich...
The Linux Kernel.
Linux 3.0.4-r4 3126  122754 Luca...
The Linux Kernel.
Linux 2.6.36.3-r1 3349  116087 Diog...
The Linux Kernel.
Linux 2.6.32.3-r1 3727  117990 Luca...
The Linux Kernel.
Linux 2.6.32-r1 3768  117751 Luca...
The Linux Kernel.
Linux 2.6.31.6-r3 3773  126499 Luca...
The Linux Kernel.
Linux 2.6.30.5-r1 3852  166102 Jona...
The Linux Kernel.
Linux 2.6.29.1-r1 3995  117500 Luca...
The Linux Kernel.
Linux 2.6.28.7-r1 4051  115518 Giam...
The Linux Kernel.
Linux 2.6.28.1-r1 4051  115487 Giam...
The Linux Kernel.
Linux 2.6.28-r1 4051  116681 Mich...
The Linux Kernel.
Linux 2.6.27.8-r1 4051  134160 Giam...
The Linux Kernel.
view entry at GitHub | download recipe.bz2 file
01-gobohide.patch
02-unionfs-2.5_for_2.6.27-rc6.patch
04-thinkpad-acpi-0.21-20081019_v2.6.27.2.patch
05-applesmc-accel-create-check.patch
06-appletouch.patch
07-applesmc-retry-when-accessing-keys.patch
08-applesmc-remove-debugging-messages.patch
09-applesmc_int.patch
10-decrypt_failed_as_debug.patch
Recipe
Resources/BuildInformation
Resources/Dependencies
Resources/Description
cell/01-copy4k-cell.patch
cell/02-memcpy-cell.patch
cell/Recipe
cell/dot-config
i686/01-acpi-dsdt-initrd-v0.9c-2.6.26.patch
i686/Recipe
i686/dot-config
ppc/Recipe
ppc/dot-config
x86_64/Recipe
x86_64/dot-config
Add a new CPU feature bit, CPU_FTR_CP_USE_DCBTZ, to be added to the
64bit powerpc chips that benefit from having dcbt and dcbz
instructions used in their memory copy routines.

This will be used in a subsequent patch that updates copy_4K_page().
The new bit is added to Cell, PPC970 and Power4 because they show
better performance with the new copy_4K_page() when dcbt and dcbz
instructions are used.

Signed-off-by: Mark Nelson <markn at au1.ibm.com>
---
 arch/powerpc/include/asm/cputable.h |    9 ++++++---
 1 file changed, 6 insertions(+), 3 deletions(-)

Index: upstream/arch/powerpc/include/asm/cputable.h
===================================================================
--- upstream.orig/arch/powerpc/include/asm/cputable.h
+++ upstream/arch/powerpc/include/asm/cputable.h
@@ -192,6 +192,7 @@ extern const char *powerpc_base_platform
 #define CPU_FTR_NO_SLBIE_B		LONG_ASM_CONST(0x0008000000000000)
 #define CPU_FTR_VSX			LONG_ASM_CONST(0x0010000000000000)
 #define CPU_FTR_SAO			LONG_ASM_CONST(0x0020000000000000)
+#define CPU_FTR_CP_USE_DCBTZ		LONG_ASM_CONST(0x0040000000000000)
 
 #ifndef __ASSEMBLY__
 
@@ -387,10 +388,11 @@ extern const char *powerpc_base_platform
 	    CPU_FTR_MMCRA | CPU_FTR_CTRL)
 #define CPU_FTRS_POWER4	(CPU_FTR_USE_TB | CPU_FTR_LWSYNC | \
 	    CPU_FTR_HPTE_TABLE | CPU_FTR_PPCAS_ARCH_V2 | CPU_FTR_CTRL | \
-	    CPU_FTR_MMCRA)
+	    CPU_FTR_MMCRA | CPU_FTR_CP_USE_DCBTZ)
 #define CPU_FTRS_PPC970	(CPU_FTR_USE_TB | CPU_FTR_LWSYNC | \
 	    CPU_FTR_HPTE_TABLE | CPU_FTR_PPCAS_ARCH_V2 | CPU_FTR_CTRL | \
-	    CPU_FTR_ALTIVEC_COMP | CPU_FTR_CAN_NAP | CPU_FTR_MMCRA)
+	    CPU_FTR_ALTIVEC_COMP | CPU_FTR_CAN_NAP | CPU_FTR_MMCRA | \
+	    CPU_FTR_CP_USE_DCBTZ)
 #define CPU_FTRS_POWER5	(CPU_FTR_USE_TB | CPU_FTR_LWSYNC | \
 	    CPU_FTR_HPTE_TABLE | CPU_FTR_PPCAS_ARCH_V2 | CPU_FTR_CTRL | \
 	    CPU_FTR_MMCRA | CPU_FTR_SMT | \
@@ -411,7 +413,8 @@ extern const char *powerpc_base_platform
 #define CPU_FTRS_CELL	(CPU_FTR_USE_TB | CPU_FTR_LWSYNC | \
 	    CPU_FTR_HPTE_TABLE | CPU_FTR_PPCAS_ARCH_V2 | CPU_FTR_CTRL | \
 	    CPU_FTR_ALTIVEC_COMP | CPU_FTR_MMCRA | CPU_FTR_SMT | \
-	    CPU_FTR_PAUSE_ZERO | CPU_FTR_CI_LARGE_PAGE | CPU_FTR_CELL_TB_BUG)
+	    CPU_FTR_PAUSE_ZERO | CPU_FTR_CI_LARGE_PAGE | \
+	    CPU_FTR_CELL_TB_BUG | CPU_FTR_CP_USE_DCBTZ)
 #define CPU_FTRS_PA6T (CPU_FTR_USE_TB | CPU_FTR_LWSYNC | \
 	    CPU_FTR_HPTE_TABLE | CPU_FTR_PPCAS_ARCH_V2 | \
 	    CPU_FTR_ALTIVEC_COMP | CPU_FTR_CI_LARGE_PAGE | \

This new copy_4K_page() function was originally tuned for the best
performance on the Cell processor, but after testing on more 64bit
powerpc chips it was found that with a small modification it either
matched the performance offered by the current mainline version or
bettered it by a small amount.

It was found that on a Cell-based QS22 blade the amount of system
time measured when compiling a 2.6.26 pseries_defconfig decreased
by 4%. Using the same test, a 4-way 970MP machine saw a decrease of
2% in system time. No noticeable change was seen on Power4, Power5
or Power6.

The 4096 byte page is copied in thirty-two 128 byte strides. An
initial setup loop executes dcbt instructions for the whole source
page and dcbz instructions for the whole destination page. To do
this, the cache line size is retrieved from ppc64_caches.

A new CPU feature bit, CPU_FTR_CP_USE_DCBTZ, (introduced in the
previous patch) is used to make the modification to this new copy
routine - on Power4, 970 and Cell the feature bit is set so the
setup loop is executed, but on all other 64bit chips the setup
loop is nop'ed out.

Signed-off-by: Mark Nelson <markn at au1.ibm.com>
---
 arch/powerpc/lib/copypage_64.S |  198 +++++++++++++++++++----------------------
 1 file changed, 93 insertions(+), 105 deletions(-)

Index: upstream/arch/powerpc/lib/copypage_64.S
===================================================================
--- upstream.orig/arch/powerpc/lib/copypage_64.S
+++ upstream/arch/powerpc/lib/copypage_64.S
@@ -1,5 +1,5 @@
 /*
- * Copyright (C) 2002 Paul Mackerras, IBM Corp.
+ * Copyright (C) 2008 Mark Nelson, IBM Corp.
  *
  * This program is free software; you can redistribute it and/or
  * modify it under the terms of the GNU General Public License
@@ -8,112 +8,100 @@
  */
 #include <asm/processor.h>
 #include <asm/ppc_asm.h>
+#include <asm/asm-offsets.h>
+
+        .section        ".toc","aw"
+PPC64_CACHES:
+        .tc             ppc64_caches[TC],ppc64_caches
+        .section        ".text"
+
 
 _GLOBAL(copy_4K_page)
-	std	r31,-8(1)
-	std	r30,-16(1)
-	std	r29,-24(1)
-	std	r28,-32(1)
-	std	r27,-40(1)
-	std	r26,-48(1)
-	std	r25,-56(1)
-	std	r24,-64(1)
-	std	r23,-72(1)
-	std	r22,-80(1)
-	std	r21,-88(1)
-	std	r20,-96(1)
-	li	r5,4096/32 - 1
+	li	r5,4096		/* 4K page size */
+BEGIN_FTR_SECTION
+	ld      r10,PPC64_CACHES@toc(r2)
+	lwz	r11,DCACHEL1LOGLINESIZE(r10)	/* log2 of cache line size */
+	lwz     r12,DCACHEL1LINESIZE(r10)	/* get cache line size */
+	li	r9,0
+	srd	r8,r5,r11
+
+	mtctr	r8
+setup:
+	dcbt	r9,r4
+	dcbz	r9,r3
+	add	r9,r9,r12
+	bdnz	setup
+END_FTR_SECTION_IFSET(CPU_FTR_CP_USE_DCBTZ)
 	addi	r3,r3,-8
-	li	r12,5
-0:	addi	r5,r5,-24
-	mtctr	r12
-	ld	r22,640(4)
-	ld	r21,512(4)
-	ld	r20,384(4)
-	ld	r11,256(4)
-	ld	r9,128(4)
-	ld	r7,0(4)
-	ld	r25,648(4)
-	ld	r24,520(4)
-	ld	r23,392(4)
-	ld	r10,264(4)
-	ld	r8,136(4)
-	ldu	r6,8(4)
-	cmpwi	r5,24
-1:	std	r22,648(3)
-	std	r21,520(3)
-	std	r20,392(3)
-	std	r11,264(3)
-	std	r9,136(3)
-	std	r7,8(3)
-	ld	r28,648(4)
-	ld	r27,520(4)
-	ld	r26,392(4)
-	ld	r31,264(4)
-	ld	r30,136(4)
-	ld	r29,8(4)
-	std	r25,656(3)
-	std	r24,528(3)
-	std	r23,400(3)
-	std	r10,272(3)
-	std	r8,144(3)
-	std	r6,16(3)
-	ld	r22,656(4)
-	ld	r21,528(4)
-	ld	r20,400(4)
-	ld	r11,272(4)
-	ld	r9,144(4)
-	ld	r7,16(4)
-	std	r28,664(3)
-	std	r27,536(3)
-	std	r26,408(3)
-	std	r31,280(3)
-	std	r30,152(3)
-	stdu	r29,24(3)
-	ld	r25,664(4)
-	ld	r24,536(4)
-	ld	r23,408(4)
-	ld	r10,280(4)
-	ld	r8,152(4)
-	ldu	r6,24(4)
+	srdi    r8,r5,7		/* page is copied in 128 byte strides */
+	addi	r8,r8,-1	/* one stride copied outside loop */
+
+	mtctr	r8
+
+	ld	r5,0(r4)
+	ld	r6,8(r4)
+	ld	r7,16(r4)
+	ldu	r8,24(r4)
+1:	std	r5,8(r3)
+	ld	r9,8(r4)
+	std	r6,16(r3)
+	ld	r10,16(r4)
+	std	r7,24(r3)
+	ld	r11,24(r4)
+	std	r8,32(r3)
+	ld	r12,32(r4)
+	std	r9,40(r3)
+	ld	r5,40(r4)
+	std	r10,48(r3)
+	ld	r6,48(r4)
+	std	r11,56(r3)
+	ld	r7,56(r4)
+	std	r12,64(r3)
+	ld	r8,64(r4)
+	std	r5,72(r3)
+	ld	r9,72(r4)
+	std	r6,80(r3)
+	ld	r10,80(r4)
+	std	r7,88(r3)
+	ld	r11,88(r4)
+	std	r8,96(r3)
+	ld	r12,96(r4)
+	std	r9,104(r3)
+	ld	r5,104(r4)
+	std	r10,112(r3)
+	ld	r6,112(r4)
+	std	r11,120(r3)
+	ld	r7,120(r4)
+	stdu	r12,128(r3)
+	ldu	r8,128(r4)
 	bdnz	1b
-	std	r22,648(3)
-	std	r21,520(3)
-	std	r20,392(3)
-	std	r11,264(3)
-	std	r9,136(3)
-	std	r7,8(3)
-	addi	r4,r4,640
-	addi	r3,r3,648
-	bge	0b
-	mtctr	r5
-	ld	r7,0(4)
-	ld	r8,8(4)
-	ldu	r9,16(4)
-3:	ld	r10,8(4)
-	std	r7,8(3)
-	ld	r7,16(4)
-	std	r8,16(3)
-	ld	r8,24(4)
-	std	r9,24(3)
-	ldu	r9,32(4)
-	stdu	r10,32(3)
-	bdnz	3b
-4:	ld	r10,8(4)
-	std	r7,8(3)
-	std	r8,16(3)
-	std	r9,24(3)
-	std	r10,32(3)
-9:	ld	r20,-96(1)
-	ld	r21,-88(1)
-	ld	r22,-80(1)
-	ld	r23,-72(1)
-	ld	r24,-64(1)
-	ld	r25,-56(1)
-	ld	r26,-48(1)
-	ld	r27,-40(1)
-	ld	r28,-32(1)
-	ld	r29,-24(1)
-	ld	r30,-16(1)
-	ld	r31,-8(1)
+
+	std	r5,8(r3)
+	ld	r9,8(r4)
+	std	r6,16(r3)
+	ld	r10,16(r4)
+	std	r7,24(r3)
+	ld	r11,24(r4)
+	std	r8,32(r3)
+	ld	r12,32(r4)
+	std	r9,40(r3)
+	ld	r5,40(r4)
+	std	r10,48(r3)
+	ld	r6,48(r4)
+	std	r11,56(r3)
+	ld	r7,56(r4)
+	std	r12,64(r3)
+	ld	r8,64(r4)
+	std	r5,72(r3)
+	ld	r9,72(r4)
+	std	r6,80(r3)
+	ld	r10,80(r4)
+	std	r7,88(r3)
+	ld	r11,88(r4)
+	std	r8,96(r3)
+	ld	r12,96(r4)
+	std	r9,104(r3)
+	std	r10,112(r3)
+	std	r11,120(r3)
+	std	r12,128(r3)
 	blr

Linux 2.6.27.4-r3 4051  149529 Luca...
The Linux Kernel.
Linux 2.6.25.17-r1 4051  172834 Giam...
The Linux Kernel.
Linux 2.6.25.16-r1 4051  166500 Giam...
The Linux Kernel.
Linux 2.6.25.10-r2 4051  165320 Giam...
The Linux Kernel.
Linux 2.6.25.7-r1 4051  157294 Giam...
The Linux Kernel.
Linux 2.6.25.4-r1 4051  133017 Hopp...
The Linux Kernel.
Linux 2.6.25-r1 4051  133216 Luca...
The Linux Kernel.
Linux 2.6.24.4-r5 4051  150733
The Linux Kernel.
Linux 2.6.24.3-r5 4051  150221 Luca...
The Linux Kernel.
Linux 2.6.24.2-r3 4051  146488 Giam...
The Linux Kernel.
Linux 2.6.24.1-r1 4051  146454 Giam...
The Linux Kernel.
Linux 2.6.24-r1 4051  146428 Luca...
The Linux Kernel.
Linux 2.6.23.8-r4 4051  155842 Luca...
The Linux Kernel.
Linux 2.6.22.7-r4 4051  114727 Luca...
The Linux Kernel.
Linux 2.6.22.1-r1 4051  121391 Luca...
The Linux Kernel.
Linux 2.6.21.1-r3 4051  118854 Luca...
The Linux Kernel.
Linux 2.6.20.7-r1 4051  117945 Luca...
The Linux Kernel.
Linux 2.6.20.4-r3 4051  151150 Luca...
The Linux Kernel.
Linux 2.6.20-r1 4051  106429 Luca...
The Linux Kernel.
Linux 2.6.18.3-r2 4051  111124 Luca...
The Linux Kernel.
Linux 2.6.17.11-r1 4051  164053 Luca...
The Linux Kernel.
Linux 2.6.17.3-r1 4051  165067 Luca...
The Linux Kernel.
Linux 2.6.16.20-r1 4051  133625 Luca...
The Linux Kernel.
Linux 2.6.16.14-r1 4051  168270 Luca...
The Linux Kernel.
Linux 2.6.15.5-r1 4051  224686 Jona...
The Linux Kernel.
Linux 2.6.15.2-r1 4051  177165 Carl...
The Linux Kernel.
Linux 2.6.15.1-r1 4051  149219 Jona...
The Linux Kernel.
Linux 2.6.15-r1 4051  149214 Luca...
The Linux Kernel.
Linux 2.6.14.4-r1 4051  150166 Jona...
The Linux Kernel.
Linux 2.6.14.3-r1 4051  150060 Jona...
The Linux Kernel.
Linux 2.6.14.2-r1 4051  149791 Carl...
The Linux Kernel.
Linux 2.6.13.4-r1 4051  149559 Luca...
The Linux Kernel.
Linux 2.6.13.2-r1 4051  56611 Jona...
The Linux Kernel.
Linux 2.6.13.1-r1 4051  56378 Luca...
The Linux Kernel.
Linux 2.6.12.2-r1 4051  50355 Luca...
The Linux Kernel.
Linux 2.6.11.9-r1 4051  98969 Luca...
The Linux Kernel.
Linux 2.6.11.8-r1 4051  109424 Jona...
The Linux Kernel.
Linux 2.6.11-r1 4051  99032 Luca...
The Linux Kernel.
Linux 2.6.10-r1 4051  65969 Luca...
The Linux Kernel.
Linux 2.6.9-r1 4051  278461 Luca...
The Linux Kernel.
Linux 2.6.8.1-r1 4051  35576 Luca...
The Linux Kernel.
Linux 2.6.7-r1 4051  22610 Luca...
The Linux Kernel.
Linux 2.6.6-r1 4051  21958 Luca...
The Linux Kernel.
Linux 2.4.26-r1 4051  22359 Luca...
The Linux Kernel.