DAMASK_EICMD/processing/post/binXY.py

#!/usr/bin/env python3

import os
import sys
from optparse import OptionParser

import numpy as np

import damask


scriptName = os.path.splitext(os.path.basename(__file__))[0]
scriptID   = ' '.join([scriptName,damask.version])


# --------------------------------------------------------------------
#                                MAIN
# --------------------------------------------------------------------

parser = OptionParser(option_class=damask.extendableOption, usage='%prog options [ASCIItable(s)]', description = """
Produces a binned grid of two columns from an ASCIItable, i.e. a two-dimensional probability density map.

""", version = scriptID)

parser.add_option('-d','--data',
                  dest = 'data',
                  type = 'string', nargs = 2, metavar = 'string string',
                  help = 'column labels containing x and y ')
parser.add_option('-w','--weight',
                  dest = 'weight',
                  type = 'string', metavar = 'string',
                  help = 'column label containing weight of (x,y) point')
parser.add_option('-b','--bins',
                  dest = 'bins',
                  type = 'int', nargs = 2, metavar = 'int int',
                  help = 'number of bins in x and y direction [%default]')
parser.add_option('-t','--type',
                  dest = 'type',
                  type = 'string', nargs = 3, metavar = 'string string string',
                  help = 'type (linear/log) of x, y, and z axis [%default]')
parser.add_option('-x','--xrange',
                  dest = 'xrange',
                  type = 'float', nargs = 2, metavar = 'float float',
                  help = 'min max limits in x direction (optional)')
parser.add_option('-y','--yrange',
                  dest = 'yrange',
                  type = 'float', nargs = 2, metavar = 'float float',
                  help = 'min max limits in y direction (optional)')
parser.add_option('-z','--zrange',
                  dest = 'zrange',
                  type = 'float', nargs = 2, metavar = 'float float',
                  help = 'min max limits in z direction (optional)')
parser.add_option('-i','--invert',
                  dest = 'invert',
                  action = 'store_true',
                  help = 'invert probability density')
parser.add_option('-r','--rownormalize',
                  dest = 'normRow',
                  action = 'store_true',
                  help = 'normalize probability density in each row')
parser.add_option('-c','--colnormalize',
                  dest = 'normCol',
                  action = 'store_true',
                  help = 'normalize probability density in each column')

parser.set_defaults(bins = (10,10),
                    type = ('linear','linear','linear'),
                    xrange = (0.0,0.0),
                    yrange = (0.0,0.0),
                    zrange = (0.0,0.0),
                   )

(options,filenames) = parser.parse_args()

minmax = np.array([np.array(options.xrange),
                   np.array(options.yrange),
                   np.array(options.zrange)])
grid   = np.zeros(options.bins,'f')
result = np.zeros((options.bins[0],options.bins[1],3),'f')

if options.data is None: parser.error('no data columns specified.')

labels = list(options.data)


if options.weight is not None: labels += [options.weight]                                               # prevent character splitting of single string value

# --- loop over input files -------------------------------------------------------------------------

if filenames == []: filenames = [None]

for name in filenames:
  try:    table = damask.ASCIItable(name = name,
                                    outname = os.path.join(os.path.dirname(name),
                                                           'binned-{}-{}_'.format(*options.data) +
                                                          ('weighted-{}_'.format(options.weight) if options.weight else '') +
                                                           os.path.basename(name)) if name else name,
                                    buffered = False)
  except: continue
  damask.util.report(scriptName,name)

# ------------------------------------------ read header ------------------------------------------

  table.head_read()

# ------------------------------------------ sanity checks ----------------------------------------

  missing_labels = table.data_readArray(labels)

  if len(missing_labels) > 0:
    damask.util.croak('column{} {} not found.'.format('s' if len(missing_labels) > 1 else '',', '.join(missing_labels)))
    table.close(dismiss = True)
    continue

  for c in (0,1):                                                                                   # check data minmax for x and y (i = 0 and 1)
    if (minmax[c] == 0.0).all(): minmax[c] = [table.data[:,c].min(),table.data[:,c].max()]
    if options.type[c].lower() == 'log':                                                            # if log scale
      table.data[:,c] = np.log(table.data[:,c])                                                     # change x,y coordinates to log
      minmax[c] = np.log(minmax[c])                                                                 # change minmax to log, too

  delta = minmax[:,1]-minmax[:,0]
  (grid,xedges,yedges) = np.histogram2d(table.data[:,0],table.data[:,1],
                                        bins=options.bins,
                                        range=minmax[:2],
                                        weights=None if options.weight is None else table.data[:,2])

  if options.normCol:
    for x in range(options.bins[0]):
      sum = np.sum(grid[x,:])
      if sum > 0.0:
        grid[x,:] /= sum
  if options.normRow:
    for y in range(options.bins[1]):
      sum = np.sum(grid[:,y])
      if sum > 0.0:
        grid[:,y] /= sum

  if (minmax[2] == 0.0).all(): minmax[2] = [grid.min(),grid.max()]                                   # auto scale from data
  if minmax[2,0] == minmax[2,1]:
    minmax[2,0] -= 1.
    minmax[2,1] += 1.
  if (minmax[2] == 0.0).all():                                                                       # no data in grid?
    damask.util.croak('no data found on grid...')
    minmax[2,:] = np.array([0.0,1.0])                                                                # making up arbitrary z minmax
  if options.type[2].lower() == 'log':
    grid = np.log(grid)
    minmax[2] = np.log(minmax[2])

  delta[2] = minmax[2,1]-minmax[2,0]

  for x in range(options.bins[0]):
    for y in range(options.bins[1]):
      result[x,y,:] = [minmax[0,0]+delta[0]/options.bins[0]*(x+0.5),
                       minmax[1,0]+delta[1]/options.bins[1]*(y+0.5),
                       min(1.0,max(0.0,(grid[x,y]-minmax[2,0])/delta[2]))]

  for c in (0,1):
    if options.type[c].lower() == 'log': result[:,:,c] = np.exp(result[:,:,c])

  if options.invert: result[:,:,2] = 1.0 - result[:,:,2]

# --- assemble header -------------------------------------------------------------------------------

  table.info_clear()
  table.info_append(scriptID + '\t' + ' '.join(sys.argv[1:]))
  table.labels_clear()
  table.labels_append(['bin_%s'%options.data[0],'bin_%s'%options.data[1],'z'])
  table.head_write()

# --- output result ---------------------------------------------------------------------------------

  table.data = result.reshape(options.bins[0]*options.bins[1],3)
  table.data_writeArray()

  table.close()
new version of numpy complain about overlong range argument 2018-12-09 12:27:05 +05:30			`#!/usr/bin/env python3`
new script to generate a twoD probability density map from ASCIItable data 2012-05-08 00:39:11 +05:30
standardizing import follows PEP style guide, encoding not needed for python3 2019-06-14 16:33:30 +05:30			`import os`
			`import sys`
added some more post processing tests and improved output 2014-08-07 00:36:33 +05:30			`from optparse import OptionParser`
standardizing import follows PEP style guide, encoding not needed for python3 2019-06-14 16:33:30 +05:30
			`import numpy as np`

added some more post processing tests and improved output 2014-08-07 00:36:33 +05:30			`import damask`
new script to generate a twoD probability density map from ASCIItable data 2012-05-08 00:39:11 +05:30
standardizing import follows PEP style guide, encoding not needed for python3 2019-06-14 16:33:30 +05:30
python files now report their version depending on VERSION file in $DAMASK_ROOT 2016-01-27 22:36:00 +05:30			`scriptName = os.path.splitext(os.path.basename(__file__))[0]`
			`scriptID = ' '.join([scriptName,damask.version])`
new script to generate a twoD probability density map from ASCIItable data 2012-05-08 00:39:11 +05:30
standardizing import follows PEP style guide, encoding not needed for python3 2019-06-14 16:33:30 +05:30
new script to generate a twoD probability density map from ASCIItable data 2012-05-08 00:39:11 +05:30			`# --------------------------------------------------------------------`
			`# MAIN`
			`# --------------------------------------------------------------------`

more verbose help, drop support for really old vtk 2019-02-17 02:50:10 +05:30			`parser = OptionParser(option_class=damask.extendableOption, usage='%prog options [ASCIItable(s)]', description = """`
new script to generate a twoD probability density map from ASCIItable data 2012-05-08 00:39:11 +05:30			`Produces a binned grid of two columns from an ASCIItable, i.e. a two-dimensional probability density map.`
added some more post processing tests and improved output 2014-08-07 00:36:33 +05:30
			`""", version = scriptID)`

adopted new ASCIItable API. some polishing. 2015-08-08 03:38:54 +05:30			`parser.add_option('-d','--data',`
			`dest = 'data',`
use np.histogram2d, fixed list.append bug when using weight column 2016-09-10 01:47:00 +05:30			`type = 'string', nargs = 2, metavar = 'string string',`
adopted new ASCIItable API. some polishing. 2015-08-08 03:38:54 +05:30			`help = 'column labels containing x and y ')`
			`parser.add_option('-w','--weight',`
			`dest = 'weight',`
			`type = 'string', metavar = 'string',`
			`help = 'column label containing weight of (x,y) point')`
			`parser.add_option('-b','--bins',`
			`dest = 'bins',`
			`type = 'int', nargs = 2, metavar = 'int int',`
			`help = 'number of bins in x and y direction [%default]')`
			`parser.add_option('-t','--type',`
			`dest = 'type',`
			`type = 'string', nargs = 3, metavar = 'string string string',`
			`help = 'type (linear/log) of x, y, and z axis [%default]')`
			`parser.add_option('-x','--xrange',`
			`dest = 'xrange',`
			`type = 'float', nargs = 2, metavar = 'float float',`
small adjustments for autodoc 2019-02-16 19:23:56 +05:30			`help = 'min max limits in x direction (optional)')`
adopted new ASCIItable API. some polishing. 2015-08-08 03:38:54 +05:30			`parser.add_option('-y','--yrange',`
			`dest = 'yrange',`
			`type = 'float', nargs = 2, metavar = 'float float',`
small adjustments for autodoc 2019-02-16 19:23:56 +05:30			`help = 'min max limits in y direction (optional)')`
adopted new ASCIItable API. some polishing. 2015-08-08 03:38:54 +05:30			`parser.add_option('-z','--zrange',`
			`dest = 'zrange',`
			`type = 'float', nargs = 2, metavar = 'float float',`
small adjustments for autodoc 2019-02-16 19:23:56 +05:30			`help = 'min max limits in z direction (optional)')`
adopted new ASCIItable API. some polishing. 2015-08-08 03:38:54 +05:30			`parser.add_option('-i','--invert',`
			`dest = 'invert',`
			`action = 'store_true',`
use np.histogram2d, fixed list.append bug when using weight column 2016-09-10 01:47:00 +05:30			`help = 'invert probability density')`
adopted new ASCIItable API. some polishing. 2015-08-08 03:38:54 +05:30			`parser.add_option('-r','--rownormalize',`
			`dest = 'normRow',`
			`action = 'store_true',`
use np.histogram2d, fixed list.append bug when using weight column 2016-09-10 01:47:00 +05:30			`help = 'normalize probability density in each row')`
adopted new ASCIItable API. some polishing. 2015-08-08 03:38:54 +05:30			`parser.add_option('-c','--colnormalize',`
			`dest = 'normCol',`
			`action = 'store_true',`
use np.histogram2d, fixed list.append bug when using weight column 2016-09-10 01:47:00 +05:30			`help = 'normalize probability density in each column')`
adopted new ASCIItable API. some polishing. 2015-08-08 03:38:54 +05:30
			`parser.set_defaults(bins = (10,10),`
			`type = ('linear','linear','linear'),`
			`xrange = (0.0,0.0),`
			`yrange = (0.0,0.0),`
			`zrange = (0.0,0.0),`
			`)`
new script to generate a twoD probability density map from ASCIItable data 2012-05-08 00:39:11 +05:30
			`(options,filenames) = parser.parse_args()`

simplifications due to better functionality available through asciitable.py output filename format has slightly changed: binned-X-Y_weighted-W_ 2015-05-21 05:38:32 +05:30			`minmax = np.array([np.array(options.xrange),`
			`np.array(options.yrange),`
			`np.array(options.zrange)])`
			`grid = np.zeros(options.bins,'f')`
added options for x and y normalization 2015-04-09 12:15:21 +05:30			`result = np.zeros((options.bins[0],options.bins[1],3),'f')`
added some more post processing tests and improved output 2014-08-07 00:36:33 +05:30
more improved scripts 2016-03-02 02:05:59 +05:30			`if options.data is None: parser.error('no data columns specified.')`
new script to generate a twoD probability density map from ASCIItable data 2012-05-08 00:39:11 +05:30
use np.histogram2d, fixed list.append bug when using weight column 2016-09-10 01:47:00 +05:30			`labels = list(options.data)`

added options for x and y normalization 2015-04-09 12:15:21 +05:30
more improved scripts 2016-03-02 02:05:59 +05:30			`if options.weight is not None: labels += [options.weight] # prevent character splitting of single string value`
added options for x and y normalization 2015-04-09 12:15:21 +05:30
adopted new ASCIItable API. some polishing. 2015-08-08 03:38:54 +05:30			`# --- loop over input files -------------------------------------------------------------------------`

updated to new ASCII table style 2015-08-18 13:26:03 +05:30			`if filenames == []: filenames = [None]`
added options for x and y normalization 2015-04-09 12:15:21 +05:30
			`for name in filenames:`
nicer code layout 2017-01-19 19:40:38 +05:30			`try: table = damask.ASCIItable(name = name,`
			`outname = os.path.join(os.path.dirname(name),`
			`'binned-{}-{}_'.format(*options.data) +`
			`('weighted-{}_'.format(options.weight) if options.weight else '') +`
			`os.path.basename(name)) if name else name,`
			`buffered = False)`
			`except: continue`
adopted philips changes for reporting, using pyflakes to clean up 2015-09-24 14:54:42 +05:30			`damask.util.report(scriptName,name)`
adopted new ASCIItable API. some polishing. 2015-08-08 03:38:54 +05:30
			`# ------------------------------------------ read header ------------------------------------------`
added options for x and y normalization 2015-04-09 12:15:21 +05:30
adopted new ASCIItable API. some polishing. 2015-08-08 03:38:54 +05:30			`table.head_read()`
added options for x and y normalization 2015-04-09 12:15:21 +05:30
adopted new ASCIItable API. some polishing. 2015-08-08 03:38:54 +05:30			`# ------------------------------------------ sanity checks ----------------------------------------`
simplifications due to better functionality available through asciitable.py output filename format has slightly changed: binned-X-Y_weighted-W_ 2015-05-21 05:38:32 +05:30
adopted new ASCIItable API. some polishing. 2015-08-08 03:38:54 +05:30			`missing_labels = table.data_readArray(labels)`
use np.histogram2d, fixed list.append bug when using weight column 2016-09-10 01:47:00 +05:30
simplifications due to better functionality available through asciitable.py output filename format has slightly changed: binned-X-Y_weighted-W_ 2015-05-21 05:38:32 +05:30			`if len(missing_labels) > 0:`
adopted philips changes for reporting, using pyflakes to clean up 2015-09-24 14:54:42 +05:30			`damask.util.croak('column{} {} not found.'.format('s' if len(missing_labels) > 1 else '',', '.join(missing_labels)))`
simplifications due to better functionality available through asciitable.py output filename format has slightly changed: binned-X-Y_weighted-W_ 2015-05-21 05:38:32 +05:30			`table.close(dismiss = True)`
			`continue`

			`for c in (0,1): # check data minmax for x and y (i = 0 and 1)`
			`if (minmax[c] == 0.0).all(): minmax[c] = [table.data[:,c].min(),table.data[:,c].max()]`
			`if options.type[c].lower() == 'log': # if log scale`
			`table.data[:,c] = np.log(table.data[:,c]) # change x,y coordinates to log`
			`minmax[c] = np.log(minmax[c]) # change minmax to log, too`
added options for x and y normalization 2015-04-09 12:15:21 +05:30
			`delta = minmax[:,1]-minmax[:,0]`
use np.histogram2d, fixed list.append bug when using weight column 2016-09-10 01:47:00 +05:30			`(grid,xedges,yedges) = np.histogram2d(table.data[:,0],table.data[:,1],`
			`bins=options.bins,`
numpy interface changed to be strict about 2D array shape in histogram 2018-12-21 03:39:53 +05:30			`range=minmax[:2],`
use np.histogram2d, fixed list.append bug when using weight column 2016-09-10 01:47:00 +05:30			`weights=None if options.weight is None else table.data[:,2])`
changed grid to float from (wrong) integer type. row and column probability density normalization now takes place BEFORE limiting data range. 2015-04-14 01:25:28 +05:30
			`if options.normCol:`
python 3 compatibility 2016-10-25 00:46:29 +05:30			`for x in range(options.bins[0]):`
changed grid to float from (wrong) integer type. row and column probability density normalization now takes place BEFORE limiting data range. 2015-04-14 01:25:28 +05:30			`sum = np.sum(grid[x,:])`
			`if sum > 0.0:`
			`grid[x,:] /= sum`
			`if options.normRow:`
python 3 compatibility 2016-10-25 00:46:29 +05:30			`for y in range(options.bins[1]):`
changed grid to float from (wrong) integer type. row and column probability density normalization now takes place BEFORE limiting data range. 2015-04-14 01:25:28 +05:30			`sum = np.sum(grid[:,y])`
			`if sum > 0.0:`
			`grid[:,y] /= sum`
use np.histogram2d, fixed list.append bug when using weight column 2016-09-10 01:47:00 +05:30
added options for x and y normalization 2015-04-09 12:15:21 +05:30			`if (minmax[2] == 0.0).all(): minmax[2] = [grid.min(),grid.max()] # auto scale from data`
			`if minmax[2,0] == minmax[2,1]:`
			`minmax[2,0] -= 1.`
			`minmax[2,1] += 1.`
			`if (minmax[2] == 0.0).all(): # no data in grid?`
adopted philips changes for reporting, using pyflakes to clean up 2015-09-24 14:54:42 +05:30			`damask.util.croak('no data found on grid...')`
added options for x and y normalization 2015-04-09 12:15:21 +05:30			`minmax[2,:] = np.array([0.0,1.0]) # making up arbitrary z minmax`
new script to generate a twoD probability density map from ASCIItable data 2012-05-08 00:39:11 +05:30			`if options.type[2].lower() == 'log':`
added some more post processing tests and improved output 2014-08-07 00:36:33 +05:30			`grid = np.log(grid)`
added options for x and y normalization 2015-04-09 12:15:21 +05:30			`minmax[2] = np.log(minmax[2])`
use np.histogram2d, fixed list.append bug when using weight column 2016-09-10 01:47:00 +05:30
added options for x and y normalization 2015-04-09 12:15:21 +05:30			`delta[2] = minmax[2,1]-minmax[2,0]`
added "no data check" and updated script backbone to be consistent with other scripts (croak). 2014-05-19 19:13:26 +05:30
python 3 compatibility 2016-10-25 00:46:29 +05:30			`for x in range(options.bins[0]):`
			`for y in range(options.bins[1]):`
added options for x and y normalization 2015-04-09 12:15:21 +05:30			`result[x,y,:] = [minmax[0,0]+delta[0]/options.bins[0]*(x+0.5),`
			`minmax[1,0]+delta[1]/options.bins[1]*(y+0.5),`
			`min(1.0,max(0.0,(grid[x,y]-minmax[2,0])/delta[2]))]`

simplifications due to better functionality available through asciitable.py output filename format has slightly changed: binned-X-Y_weighted-W_ 2015-05-21 05:38:32 +05:30			`for c in (0,1):`
			`if options.type[c].lower() == 'log': result[:,:,c] = np.exp(result[:,:,c])`
added options for x and y normalization 2015-04-09 12:15:21 +05:30
changed grid to float from (wrong) integer type. row and column probability density normalization now takes place BEFORE limiting data range. 2015-04-14 01:25:28 +05:30			`if options.invert: result[:,:,2] = 1.0 - result[:,:,2]`
added some more post processing tests and improved output 2014-08-07 00:36:33 +05:30
simplifications due to better functionality available through asciitable.py output filename format has slightly changed: binned-X-Y_weighted-W_ 2015-05-21 05:38:32 +05:30			`# --- assemble header -------------------------------------------------------------------------------`

renumbering asciitable when readArray for selected columns only test no longer for deleteColumn 2015-05-10 16:59:11 +05:30			`table.info_clear()`
			`table.info_append(scriptID + '\t' + ' '.join(sys.argv[1:]))`
reworked use of "labels" property to proper methods for access. 2016-05-17 05:25:06 +05:30			`table.labels_clear()`
			`table.labels_append(['bin_%s'%options.data[0],'bin_%s'%options.data[1],'z'])`
renumbering asciitable when readArray for selected columns only test no longer for deleteColumn 2015-05-10 16:59:11 +05:30			`table.head_write()`

simplifications due to better functionality available through asciitable.py output filename format has slightly changed: binned-X-Y_weighted-W_ 2015-05-21 05:38:32 +05:30			`# --- output result ---------------------------------------------------------------------------------`

adopted new ASCIItable API. some polishing. 2015-08-08 03:38:54 +05:30			`table.data = result.reshape(options.bins[0]*options.bins[1],3)`
			`table.data_writeArray()`

			`table.close()`