-
Notifications
You must be signed in to change notification settings - Fork 174
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
problem at deploying hstore at the AWS #152
Comments
That document looks out of date. You don't want to use the really old SVN repo. You want to use this Github one. |
yes, I tried the source from github, but it still failed to build git clone git://github.com/apavlo/h-store.git ee-build: BUILD FAILED Total time: 9 minutes 36 seconds thanks |
Is there an error from gcc? It's weird that it just fails like that? |
I think I finally figure out this problem, it runs out of all memory at I used a micro ec2 which only has 0.6g memory... I try another medium one and build successfully. to who wants to try hstore on AWS please at lease use a medium size ec2... thanks |
now I can build it but still unable to execute the benchmark at AWS NFS cluster. my 2 nfs cluster nodes within the same security group I configure the ssh environment $ ssh localhost date I scp my hstore.pem on nfs server node change the global.sshoptions parameter in $HSTORE_HOME/properties/default.properties as create a cluster.txt as follow: no problem at here $ ant hstore-benchmark -Dproject=tpcc hstore-benchmark: benchmark: BUILD FAILED Total time: 15 seconds didn't see any useful log from these 2 nodes 2014-01-29T00:58:59.895.0:~/h-store/obj/logs/sites$ cat site-01-host1.ip-172-31-xx-xxx.eu-west-1.compute.internal.log 2014-01-29T00:58:59.971.0any advices? thanks! |
Use the internal IP addresses instead of the public ones. |
yes, I am using the aws internal dns as you can see my cluster.txt and internal ip for nfs cluster but it just can't execute. do you mean I use internal ip address instead of internal dns address at cluster.txt? so like this? thanks |
Enable DEBUG for 'org/voltdb/processtools/ProcessSetManager.java' in log4j.properties See what the SSH command is that it's trying to use to start the sites and see whether you can fire them off by hand.Andy Pavlo |
sorry I am not sure I'm completely following you, I changed voltdb area as DEBUG at log4j.properties VoltDB Stufflog4j.logger.org.voltdb.VoltProcedure=DEBUG after ant hstore-prepare -Dproject=tpcc -Dhosts=/home/ubuntu/cluster.txt still, hstore-benchmark: benchmark: BUILD FAILED Total time: 11 seconds I checked the log it hasn't any useful info still $ cat site-01-host1.ip-172-31-xx-xx.eu-west-1.compute.internal.log 2014-01-29T03:16:24.778.0thanks |
hi, andy I checked ProcessSetManager.java , does use "ping" command to create the process?
I open the ICMP port to security group but still unable to execute the benchmark and then I open ALL traffic ports to all ips at this security group, so I think no matter what kind of commands hstore use it should have no problem within security group. but it still fails to execute the benchmark BUILD FAILED Total time: 50 seconds and there is no info for these two logs except date 2014-01-29T03:32:26.251.0~/h-store/obj/logs/sites$ cat site-01-host1.ip-172-31-xx-x.eu-west-1.compute.internal.log 2014-01-29T22:23:22.698.0I am quite suspecting about cluster.txt, is it on the right format? any further advices will be appreciated. thanks |
Add this to the bottom of log4j.properties:
Run the benchmark with this turned on, then check the site log to look for the SSH command that it's trying to send over the wire. Then copy and paste that command in a terminal to check whether it works. |
yes, I add it and copy the ssh commands run it by hand, it displays failed I check the source codes about connecting remote codes have two things
does the ssh login username effect the connection?
the autofs part it sets as
which automatically syncs all folders and files under /home/ but when I set each nfs server and clients ssh environment by the autofs will automatically sync each key to all other. which means I only can run at one ec2. so, should I re-write my auto.home file not sync all files under because I see document mentioned specifically that the directory needs to but it looks like against the ssh environment configuration. so would you thanks On 29 Jan 2014 22:48, "Andy Pavlo" notifications@github.com wrote:
|
hi andy I changed all ec2's hostname same as cluster.txt and only mount h-store folder instead of /home/& within NFS clusters this time, and I add this line in log4j.properties: and I run ssh command by hand, it returns as "Unable to set CPU affinity.." and "Insufficient number of cores " so disable transaction pre/post processing threads, and the connection and execution is failed. but I can execute H-store benchmark at a single large size ec2 without any problem. I build this NFS Cluster at AWS by 3 same large size ec2, it indicates insufficient number of cores. Does hstore is a sharding nosql system, each node within system is isolated with others? Should it need less system resource if I use a cluster to run this benchmark instead of a singe machine? would you give me some clues on it, please? thanks! |
hi, andy
I follow the document about running on EC2 steps as follows but failed to ant build
sudo vim /etc/apt/sources.list
deb http://archive.canonical.com/ubuntu lucid partner
deb-src http://archive.canonical.com/ubuntu lucid partner
sudo apt-get update
Package sun-java6-jdk is not available so I change it as openjdk-6-jdk
sudo apt-get --yes install subversion gcc g++ make openjdk-6-jdk valgrind ant
svn co https://database.cs.brown.edu/svn/hstore/trunk/ $HSTORE_HOME
cp hstore.pem ~/.ssh/ && chmod 400 ~/.ssh/hstore.pem
vim trunk/properties/default.properties
global.sshoptions = -i /home/ubuntu/.ssh/hstore.pem
ant build
ee:
BUILD FAILED
/home/ubuntu/trunk/build.xml:715: exec returned: 137
because svn ant build failed, so I remove it and try the source from git
sudo rm -r trunk/
sudo apt-get install git
git clone git://github.com/apavlo/h-store.git
ant build
ee-build:
[exec] make: Entering directory `/home/ubuntu/h-store/obj/release'
[exec] g++ -Wall -Wextra -Werror -Woverloaded-virtual -Wconversion -Wpointer-arith -Wcast-qual -Wcast-align -Wwrite-strings -Winit-self -Wno-sign-compare -Wno-unused-parameter -pthread -D__STDC_CONSTANT_MACROS -D__STDC_LIMIT_MACROS -DNOCLOCK -fno-omit-frame-pointer -fvisibility=hidden -DBOOST_SP_DISABLE_THREADS -Wno-ignored-qualifiers -fno-strict-aliasing -Wno-attributes -DLINUX -fPIC -Wno-unused-but-set-variable -DANTICACHE -DANTICACHE_REVERSIBLE_LRU -isystem ../../third_party/cpp -isystem ../../obj/release/berkeleydb -I../../src/ee -c -g3 -O3 -mmmx -msse -msse2 -msse3 -DNDEBUG -DVOLT_LOG_LEVEL=500 -o objects//voltdbjni.co ../../src/ee//voltdbjni.cpp
BUILD FAILED
/home/ubuntu/h-store/build.xml:860: exec returned: 137
Total time: 9 minutes 36 seconds
any helps will be greatly appreciated !
The text was updated successfully, but these errors were encountered: