BlockLib: a skeleton library for Cell broadband engine, IWMSE '08: Proceedings of the 1st international workshop on Multicore software engineering, pp.7-14, 2008. ,
Entering the petaflop era: The architecture and performance of Roadrunner, 2008 SC, International Conference for High Performance Computing, Networking, Storage and Analysis, pp.1-11, 2008. ,
DOI : 10.1109/SC.2008.5217926
STEADY-STATE SCHEDULING ON HETEROGENEOUS CLUSTERS, International Journal of Foundations of Computer Science, vol.16, issue.02, pp.163-194, 2005. ,
DOI : 10.1142/S0129054105002930
URL : https://hal.archives-ouvertes.fr/inria-00358951
Steady-State Scheduling, Introduction to Scheduling, pp.159-186, 2010. ,
DOI : 10.1201/9781420072747-c7
URL : https://hal.archives-ouvertes.fr/inria-00344157
CellSs: Scheduling Techniques to Better Exploit Memory Hierarchy, Scientific Programming, pp.77-95, 2009. ,
DOI : 10.1155/2009/561672
URL : http://doi.org/10.1155/2009/561672
Distributed processing of very large datasets with DataCutter, Parallel Computing, vol.27, issue.11, pp.1457-1478, 2001. ,
DOI : 10.1016/S0167-8191(01)00099-0
High-performance software for mathematical programming and optimization ,
Merrimac, Proceedings of the 2003 ACM/IEEE conference on Supercomputing, SC '03, p.35, 2003. ,
DOI : 10.1145/1048935.1050187
Memory---Sequoia, Proceedings of the 2006 ACM/IEEE conference on Supercomputing , SC '06, p.83, 2006. ,
DOI : 10.1145/1188455.1188543
Efficient scheduling of task graph collections on heterogeneous resources, 2009 IEEE International Symposium on Parallel & Distributed Processing, 2009. ,
DOI : 10.1109/IPDPS.2009.5161045
URL : https://hal.archives-ouvertes.fr/hal-00786257
DTA-C: A Decoupled multi-Threaded Architecture for CMP Systems, 19th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD'07), pp.263-270, 2007. ,
DOI : 10.1109/SBAC-PAD.2007.27
Exploiting DMA to enable non-blocking execution in Decoupled Threaded Architecture, 2009 IEEE International Symposium on Parallel & Distributed Processing, 2009. ,
DOI : 10.1109/IPDPS.2009.5161111
A streaming computation framework for the Cell processor. Master's thesis, Massachusetts Institute of Technology, 2007. ,
A component-based framework for the Cell Broadband Engine, 2009 IEEE International Symposium on Parallel & Distributed Processing, 2009. ,
DOI : 10.1109/IPDPS.2009.5160919
Precedence-constrained task allocation onto point-to-point networks for pipelined execution, IEEE Trans. Parallel and Distributed Systems, vol.10, issue.8, pp.838-851, 1999. ,
Introduction to the Cell multiprocessor, IBM Journal of Research and Development, vol.49, issue.4.5, pp.4-5589, 2005. ,
DOI : 10.1147/rd.494.0589
Cell Multiprocessor Communication Network: Built for Speed, IEEE Micro, vol.26, issue.3, pp.10-23, 2006. ,
DOI : 10.1109/MM.2006.49
URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.161.8331
A numa api for linux, 2005. ,
Partitioning and Scheduling Parallel Programs for Multiprocessors, 1989. ,
DAG generation program ,
Language and Compiler Support for Stream Programs, 2001. ,
Self-Adaptive Configuration of Visualization Pipeline Over Wide-Area Networks, IEEE Transactions on Computers, vol.57, issue.1, pp.55-68, 2008. ,
DOI : 10.1109/TC.2007.70777